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O units thereof. The present invention also relates to the use of hetero-oligomeric and chimeric taste receptors comprising TlRl A'lRS 
O and T1R2/T1R3 in assays to identify compounds that respectively respond to umami taste stimuli and sweet taste stimuli. Further, 
^ the invention relates to the constitutive of cell lines that stably or transiently co-express a combination of TlRl and T1R3; or T1R2 

and T1R3; under constitutive or inducible conditions. The use of these cells lines in cell-based assays to identify umami and sweet 
^ taste modulatory compounds is also provided, particularly high throughput screening assays that detect receptor activity by use of 

fluorometric imaging. 
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TIR HETERO-OLIGOMERIC TASTE RECEPTORS, CELL LINES 
THAT EXPRESS SAID RECEFIORS, AND TASTE COMPOUW 

Cross Reference to Related Applications 

5 This application claims priority to U.S. Provisional Application Serial No 

60/494,071 jfiled on August 6, 2003, and U.S. Provisional Application Serial No 
60/552,064 filed March 9, 2004, bofli of which are incorporated by reference in their 
entirety. 

Background of the Invention 

10 Field of the Invention 

The present invention; in part relates to the discovery that the TIR receptors 

assemble to form functional taste receptors. Particularly, it has been discovered that 

co~expression of TlRl and T1R3 results in a taste receptor that responds to umami 

taste stimuli, including monosodium glutamate. Also, it has been discovered that co- 
1 5 expression of the T1R2 and T1R3 receptors results in a taste receptor that responds to 

sweet taste stimuli including naturally occurring and artificial sweeteners. 

Also, the present invention relates to the use of hetero-oligomeric taste receptors 

comprising T1R1/T1R3 and T1R2/T1R3 in assays to identify compounds that 

respectively respond to umami taste stimuli and sweet taste stimuli. 
20 The invention also relates to chimeras and truncated versions of TlRl , T1R2, 

and T1R3, as well as chimeras of T1R1/T1R3 and T1R2/T1R3 receptors comprising 

human, rat, or human and rat subunits. 

Further, the invention relates to the construction of cell lines that stably or 

transiently co-express a combination of TlRl and T1R3; or T1R2 and T1R3, including 
25 truncated or chimeric versions of these subunits as well as chimeric receptors 

comprising wild-type or chimeric subimits; under constitutive or inducible conditions. 
The use of these cell lines in cell-based assays to identify umami and sweet taste 

modulatory compounds is also provided, particularly high throughput screening assays 

that detect receptor activity by the use of fluorometric imaging. 
30 The invention also relates to compounds that bmd to T1R1/T1R3, T1R2/T1R3 

receptors, as well as TlRl, T1R2, and T1R3 chuneric and truncated subunits and 

chimeric receptors. 
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Description of the Related Art 

The taste system provides sensory information about the chemical composition 
of the extemal world. Mammals are believed to have at least five basic taste 
5 modalities: sweet, bitter, sour, salty, and umami. See, e.g., Kawamura et aL, 

Introduction to Ufnami: A Basic Taste (1987); Kinnamon et al, Ayin, Rev, Physioly 
54:715-31 (1992); Lindemaun, Physiol Rev., 76:718-66 (1996); Stewart et ah. Am. J. 
Physiol, 272:1-26(1997). Each taste modality is thought to be mediated by a distinct 
protein receptor or receptors that are expressed in taste receptor cells found on the 
10 surface of the tongue (Lindemann, Physol. Rev. 76:718-716 (1996)). The taste 

receptors that recognize bitter, sweet, and iraiami taste stimuli belong to the G-protein- 
coupled receptor (GPCR) superfamily (Hoon et al. Cell 96:451 (1999); Adler et al. 
Cell 100:693 (2000)). (Other taste modalities are believed to be mediated by ion 
channels.) 

15 G protein-coupled receptoiis mediate many other physiological functions, such 

as endocrine function, exocrine function, heart rate, lipolysis» and carbohydrate 
metabolism. The biochemical analysis and molecular cloning of a number of such 
receptors has revealed many basic principles regarding the function of these receptors. 
For example. United States Patent No. 5,691,188 describes how upon a Ugand binding 

20 to a GPCR, the receptor undergoes a conformational change leading to activation of a 
heterotrimeric G protein by promoting the displacement of bound GDP by GTP on the 
surface of the Ga subimit and subsequent dissociation of the Ga subunit from the Gp 
and Gy subunits. The free Ga subunits and GPy complexes activate downstream 
elements of a variety of signal transduction pathways. 

25 The TIR receptors were previously hypothesized to function as sweet taste 

receptors (Hoon et al.. Cell 96:541-51 (1999); Kitagawa et al., Biochem BiopJtys Res. 
Commun. 283:236-42 (2001); Max et al., Nat. Genet. 28:58-63 (2001); Montmayeur et 
al., Nat. Neurosci: 4: 412-8 (2001); Sainz et al., /. Neurochem. 77: 896-903 (2001)), 
and Nelson et al. (2001) and Li et al (2002) have recently demonstrated tfiat rat and 

30 human, respectively, T1R2 and T1R3 act in combination to recognize sweet taste 
stimuU. 

However, there remains in the art a need for new and improved flavoring 
agents. For exsaxuple, one of the five known basic tastes is the '"savory" or '^umami'" 
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flavor of monosodium glutamate CMSG")- MSG is known to produce adverse 
reactions in some people, but very little progress has been made in identifying artificial 
substitutes for MSG. It is known that a few naturally occurring materials can increase 
or enhance the effectiveness of MSG as a savory flavoring agent, so that less MSG 
5 would be needed for a given flavoring application. For example the naturally occurring 
nucleotide compounds inosine monophosphate (MP) or guanosine monophosphate 
(GMP) are known to have a multiplier effect on the savory taste of MSG, but IMP and 
GMP are very difficult and expensive to isolate and purify from natural sources, or 
synthesize, and hence have only limited practical application to most commercial needs 
10 in food or medicinal compositions. Less expensive compounds that would provide the 
flavor of MSG itself, or enhance the effectiveness of any MSG that is present could be 
of very high value. Similarly, discovery of compounds that are either new **High 
Intensity' sweeteners (i.e. they are many times sweeter than sucrose) would be of 
value. 

15 What is needed in the art is the identification and characterization of taste 

receptors which fimction as sweet and umami receptors, assays for identifying 
compounds that modulate (enhance or block) sweet and umami taste, and the 
compounds that specifically bind to these receptors. 

20 Summary of the Invention 

The present invention provides chimeric receptors comprising various 
combinations of human and rat TIRs, such as a chimeric T1R2/T1R3 receptor 
comprising a human T1R2 subunit and a rat T1R3 subimit; a chimeric T1R2/T1R3 
receptor comprising a rat T1R2 subunit and a human T1R3 subunit; a chimeric T1R2 
25 receptor subunit comprising a human extracellular domain, a rat transmembrane 
domain and a rat intracellular domain; and a chimeric T1R3 receptor subimit 
comprising a rat extracellular domain, a himian transmembrane domain and a human 
intracellular domain. 

The present invention also provides compounds that specifically bind to TlRl, 
30 T1R2, T1R3, T1R1/T1R3 and T1R2/T1R3, or isolated subunits, fragments, chimeras or 
truncated versions thereof as disclosed herein. 

The present invention relates to the discovery that different combinations of 
TIRs, when co-expressed, produce functional taste receptors that respond to taste 
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stimuli. Particularly, the present invention relates to the discovery that co-exptession of 
T1R2 and T1R3 results in a hetero-oligoraeric taste receptor that responds to sweet 
taste stimuli. Also, the present invention relates to the discovery that the co-expression 
of TlRl and T1R3 results in a hetero-oligomeric taste receptor that responds to umami 
S taste stimuli such as monosodium glutamate. 

The present invention also relates to cell lines that co-express TlRl and T1R3, 
including human or rat, or T1R2 and T1R3, including human or rat. In preferred 
embodiments these cell lines will express elevated, amounts of the receptors, either 
constitutively or inducibly. These cell lines include cells that transiently or stably 
10 express TlRl and T1R3 or T1R2 and T1R3, 

Also, the present invention provides assays, preferably high througihput 
screening assays, that utilize the TlR2/riR3 taste receptor, or the T1R1/T1R3 receptor, 
preferably high throughput cell-based assays, to identify compounds that modulate 
sweet or uitiami taste. The invention also provides assays that include taste tests to 
IS confirm that these compounds modulate sweet or umami taste. 

The invention also relates to compounds that bind to the N-terminal 
extracellular domain of T1R2, compounds that bind to the cysteine-rich domain of 
T1R2, compounds that bind to the Transmembrane Domain of T1R2, compounds that 
bind to the Transmembrane Domain of T1R3, compounds that bind to the 
20 Transmembrane Domain of T1R2 of a truncated receptor h2TM/h3TM, and compourids 
that bind to the lYansmembrane Domain of T1R3 of a truncated receptor h2TM/h3TM, 
for example. 

Brief Descriptfon of the Figures 
25 Figure 1 contains a sequence alignment of human and rat TlRs, human 

calcium-sensing receptor and rat metabotropic glutamate receptor. 

Figure 2 contains RT-PCR amplification e^erimental results which show 
that hTlR2 and hTlR3 are expressed in taste tissue. 

Figure 3a - 3b contam functional data (mtracellular calcium responses) 
30 elicited by different sweet taste stimuli in HEK cells stably expressing Gais that are 
transiently transfected with human T1R2, T1R3 and T1R2/T1R3 at various 
concentrations of sweet taste stimuli (Figure 3a); human T1R2/T1R3 dose responses 
for several sweet taste stimuli (Figure 3b); human T1R2/T1R3 responses to sucrose in 
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the presence of gurmarin, and endogenous p2-adrenergic receptor responses to 
isoproterenol in the presence of gunnarin. Figure 3c contains the normalized response 
to different sweeteners. 

Figure 4 contains intracellular calcium responses in HEK cells stably 
5 expressing Gal 5, transiently transfected with hTlR2/hTlR3, rTlR2/it 1R3, 

hTlR2/rTlR3 and rTlR2/hTlR3 in response to 350 mM sucrose, 25 mM tryptophan, 
15 mM aspartame, and 0.05 % monellin. 

Figure 5 contains the results of a fluorescence plate reactor based assay 
wherein HEK cells stably expressing Gal 5 were transiently transfected with hTlR2 
10 and hTlRS or hTlR3 alone and contacted with the calcium dye Fluo-4 and a sweet 
taste stimulus (12.5 mM cyclamate). 

Figure 6 contains normalized dose-response curves which show that hTlR2 
and hTlR3 function in combination as the human sweet receptor based on their dose- 
specific interaction with various sweet stimuli (tip, cyclamate, sucrose, neotame, 
15 asparame, saccharin and Acek), 

Figure 7 contains structural information relating to mGluRl and TlRl 
showing the key hgand binding residues are observed in these molecides. 

Figure 8a-8c contains functional data showing HEK cells which stably 
express Gal 5 that are transiently transfected with T1R1/T1R3 respond to glutamate in 
20 an intracellular calcium-based assay. Figure 8a shows that intracellular calcium 
increases in response to increasing glutamate concentration; Figure 8b shows 
intracellular calcium responds to IMP (2 mM), glutamate (0.5 mM) and 0.2 mM IMP; 
and Figure 8c shows human T1R1/T1K3 responses for glutamate in the presence and 
absence of 0.2 mM IMP. 
25 Figures 9a-9b respectively contain the results of an immunofluorescence 

staining assay using Myc-tagged hTlR2 and a FACS experiment showing that the 
incorporation of the PDZEP peptide (SEQ ID No: 1) enhanced the expression of a TIR 
(hTlR2) on the plasma membrane. 

Figure 10a through 10b contain calcium imaging data demonstrating that 
30 hlTR2/hTlR3 respond to different sweet stimuli. 

Figure 1 1 shows the responses of cell lines which stably express hTlRl/hTlR3 
by automated fluorescence imagmg to umami taste stimuli. 
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Figure 12 shows the responses of a cell line which stably expresses 
hTlR2/hTlR3 by automated fluorescence imaging to sweet taste stimuli. 

Figure 13 shows dose-response curves determined using automated 
fluorescence imaging for a cell line that inducibly expresses the human T1R1/T1R3 
5 taste receptor for L-glutamate in the presence and absence of 0.2mM IMP. 

Figures 14 and 1 5 show the response of a cell line that inducibly expresses the 
human T1R1/T1R3 taste receptor (1-17 clone) to a panel of L-amino acids, hi Figure 
14 different C-amino acids at lOmM were tested in the presence and absence of 1 mM 
IMP. In Figure 15 dose-responses for active amino acids were determined in the 
10 presence of 0.2mM IMP. 

Figure 16 shows that lactisole inhibits the receptor activities of human 
T1R2/T1R3 and human T1R1/T1R3. 

Figure 17 shows schematics of human-rat TIR chmieras. The chimeras are 
constructed by fusing the human or rat extracellular domains to the rat or human 
15 transmembrane doniains respectively, as shown in h2-i2, r2-h2, h3-r3 and rS-hS; 

Figure 18 shows neohesperidin dihydrochalcone (NHDC) enhances the 
activities of T1R1/T1R3 umami taste receptor. [Neohesperidin dihydrochalcone] = 5 
^M. The glutamate dose response curve is left-shifted by 2.3 fold (left panel), and the 
glutamate/IMP dose response is left-shifted by 2. 1 fold. 
20 Figure 19 shows that control sweeteners do not affect the activities of 

T1R1/T1R3 umami taste receptor [Steviocide] = 0.5 mM. [Saccharin] = 1 mM. 
Glutamate dose response is shown in the left panel, and glutamate/IMP dose response is 
shown in the right panel. 

Figure 20 shows NHDC maps to the transmembrane domain of human T1R3. 
25 Figure 21 shows mapping of a compound to the human T1R2 transmembrane 

domain. 

Figures 22a-d show sweeteners which map to different domains/subunits of the 
human sweet receptor. Figure 22a shows responses of human and rat sweet receptors to 
sucrose (200 mNQ, aspartame (10 mM), neotame (0.1 mM), cyclamate (10 ruM), and 
30 sucrose (200 rxM) in the presence of lactisole (1 mM) (Suc/Lac). HEK-293T cells were 
transiently transfected with human or rat T1R2, T1R3, and a Gais chimera Gais/ti, and 
assayed for intracellular calcium inareases in resfponse to sweetoaers. Figure 22b shows 
aspartame and neotame were mapped to N- terminal extracellular domain of human 
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T1R2. Combinations of TIR chimeras were transiently transfected into HEK-293T cells 
with Gai5/u> and assayed for responses to sweeteners at the concentrations listed in 23a 
The presence or absence of response is what is important. Figure 22c shows cyclamate 
was mapped to the C-terminal transmembrane domain of human T1R3. Figure 22d 
5 shows lactisole was mapped to the transmembrane domain of human T1R3. Different 
combinations of TIR chimeras were transiently transfected into HEK-293T cells with G 
Gai5/ii, and assayed for responses to sucrose (200 mM) and AceK (10 mM) in the 
absence or presence of lactisole (1 mM). The activities in B, C and D represent the 
mean ± SB of number of responding cells for four imaged field of --1 ,000 confluent 
10 cells. 

Figures 23a-d show mutations in T1R2 or T1R3 selectively affect the activity of 
different sweeteners. Figure 23a shows sequence aligmnent of the N-tenninal ligand 
binding domain of rat mGluRS with human and rodent TlR2s. The 8 critical amino 
acids involved in ligand-binding in mGluR5 are labeled with three of the 8 amino 

1 5 acids are conserved in T1R2 and imderlined. Figure 23b shows two point mutations in 
the human T1R2 N-terminal extracellular domain that abolish response to aspartame 
and neotame without aflfecting cyclamate. Stable cell lines of hTlR2/hTlR3 (WT), 
hTlR2 S144A/hTlR3 (S144A) and hTlR2 E302A/hTlR3 (E302A) were generated as 
describe in the Examples. The dose-responses of these stable lines were determined on 

20 FLIPR for sucrose, aspartame, neotame and cyclamate. The activities represent the 

mean ± SB of fold increases in fluorescence intensities for foiu: recorded wells. Figure 
. 23c shows sequence alignment of human and rodent T1R3 transmembrane domains. 
The three extracellular loops are underlined and labeled ELI, 2, or 3, according to 
their order in the protein sequences. Figure 23d shows mutations in the extracellular 

25 loop of hTlR3 that aboUsh response to cyclamate without affecting aspartame. Each 
of the three extracellular loops of hTlR3 were replaced with rat protdn sequence 
separately, and the resulting hTlR3 mutants were transiently transfected into HEK- 
293T cells together with G am\, and assayed for responses to sucrose (200mM), 
aspartame (10 mM) and cyclamate (10 mM). The activities i^resent the mean ± SE of 

30 number of responding cells for four imaged field of ~1,000 confluent cells. 

Figures 24a-b show human T1R2 is required for Gais -coupling. Figure 24a 
shows responses of human, rat and chimeric sweet recqptors to sucrose (200 mM) 
and AceK (10 roM). Stable Gais cells were transiently transfected with human, rat or 
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chimeric TlRs, and assayed for intracellular calcium increases in response to 
sweeteners. Figure 24b shows G ois-coupling is mediated by human T1R2. The 
activities represeiit the mean ± SB of number of responding cells for four imaged 
jSeld of -1,000 confluent cells. 
5 Figures 25a-f show the effect of lactisole and cyclamate on the human 

T1R1/T1R3 umami receptor. Figure 25a shows the response of human T1R1/T1R3 
stable cell line to L-glutamate (5 mM) and L- glutamate/IMP (1/0.2 mM) in the absence 
and presence of lactisole (5 mM). Figure 25b shows the lactisole dose-dependent 
inhibition curves were determined for L-glutamate (Glu), and L- glutamate with 0.2 
1 0 mM IMP (Glu/IMP), each at two different concentrations. The ICSOs are 0. 19 ± 0.02 
mM and 0.21 ± 0.01 mM for L-glutamate at 8 and 80 mM; 0.35 ± 0.03 mM and 0.82 ± 
0.06 mM for L-glutamate with IMP at 0.8 and 8 mM respectively. Figure 25c shows the 
dose responses for L-glutamate, with or witiiout 0.2 mM IMP, were determined iii the 
presence of different concentrations of lactisole. In the presence of 0, 25, or 50 fiM 
15 lactisole, the EC50s are 9.9 ± 1.5 mM, 7.9 ± 0.5 mM, and 7.0 ± 0.3 mM for 

glutamate; in the presence of 0, 100, or 200 |iiM lactisole, the ECsos are 0.53 ± 0,04 
mM. 0.71 ± 0.10 mM, and 0.84 ± 0.10 mM for L-glutamate with IMP. Values represent 
the mean ± SE for four indq)endent responses. Figure 25d shows the detection 
thresholds for sweet, umami, and salty taste stimuli were determined in the presence or 
20 absence of lactisole. The inhibition ejBFect of lactisole is shown as fold increases in 
detection fluresholds. *T)etection thresholds" are defined as the lower limit of detectable 
tastants. The detection threshold values were averaged over four trials for three subjects. 
Figure 25e shows the responses of human T1R1/T1R3 stable cell line to threshold level 
of L-glutamate (4 mM) and endogenous M2 receptor agonist carbachol were assayed on 
25 FLIPR in the absence and presence of various concentrations of cyclamate. Figure 25f 
shows the dose-responses of the human T1R1/T1R3 stable cell line were determined on 
FLIPR for L-glutamate with or without 0.2 mM IMP in the absence and presence of 
cyclamate (8 mM). The activities in B, C, E and F represent the mean ± SE of fold 
increases in fluorescence intensities for four recorded wells. The dose- responses in B, 
30 C, E and F were reproduced at least 6 times independently. 

Figure 26 shows a working model for the sweet and umami taste receptor 
structure-fimctipn relationships. Filled arrows indicate direct activation, open arrows 
indicate enhancement, and bar heads indicate inhibition. 
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Figure 27a shows aU 16 combinations of TlRs and chimeras that were tested 
for responses to sweeteners and lactisole. iTlR2/TlR3H-R, rTlR2/hTlR3, and T1R2H- 
R/T1R3R-H show a significant response to cyclamate and they can be inhibited by 
lactisole. TIR chimeras were transiently transfected into HEK-293T cells with Gais/ii* 
S The activities represent the mean i: SE of number of responding cells for four imaged 
field of --1,000 confluent cells, each unit on the Y axis represents 50 responding cells. 
Abbreviations: Sue (sucrose lOOmM); Suc/Lac (sucrose lOOmM, lactisole 1 mM); 
AceK (acesulfame K 10 uM); AceK/Lac (acesulfame K 10 mM, lactisole 1 mM); 
ATM (aspartame lOmM); NTM (neotame 10 mM); Cyc (cyclamate 10 mM). Figure 

10 27b shows the lactisole dose-dependent inhibition curves of the human sweet receptor 
were determined for sucrose (Sue), saccharin (Sac), and D-tryptophan (D-Tip), each at 
two different concentrations. The IC50s are 19.6 ± 0.1 |iiM and 64.6 ± 0.3 fxM for 
sucrose at SO mM and 120 mM; 22.6 ± 0.1 ^M and 103 =b 7 pM for saccharin at 0.1 and 
2 mM; 19.9 ± 0.2 jiM and 168 ± 9 fiM for D-tryptophan respectively. Figure 27c shows 

1 S the dose responses of human sweet receptor for sucrose, D-Tip and saccharin were 

determined with different concentrations of lactisole. In the presence of 0, 10, or 20 ^M 
lactisole, the ECsos are 19.4 ± 0.9 mM, 24.7 ±1.0 mM, and 31.3 ± 0.3 mM for sucrose; 
0.37 ± 0.02 mM, 0.60 ± 0.03 mM, 0.94 ± 0.08 mM for D-Trp; 42 d= 3 fiM, 67 ± 6 fiM, 
118^2 ^M for saccharin. Values represent the mean ± SE for four independent 

20 responses. The dose-responses in B and C were determined at least 6 times 
independently, and generated similar results as shown here. 

Detailed Description of the Invention 
The inventions provides compounds that specifically bind to the wild-type and 
25 chimeric sweet and umami taste receptors disclosed herein. Further provided are 

compounds that specifically bind to the wild-type, chimeric or truncated T1R2 or T1R3 
subunits of the sweet and umami receptors. 

Binding to the T1R2/T1R3 sweet receptor defines a large genus of molecules. 
The receptor responds to every sweetener tested, including carbohydrate sugars, amino 
30 acids and derivatives, sweet proteins, and synthetic sweetmers. In the meantime, the 
receptor exhibits stereo-selectivity for certain sweeteners, for example, it responds to D- 
tryptophan but not L-tryptophan, which is in correlation with taste physiology data. 
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Thus, the compounds of the invention spedjGically bind chimeric recq)tors. 
Examples include, but are not limited to, a chimeric T1R2/T1R3 receptor comprising a 
human T1R2 subunit and a rat T1R3 subunit; a chimeric T1R2/T1R3 receptor 
conq>rising a rat T1R2 subunit and a human T1R3 subunit; a chimeric T1R2 receptor 
5 subunit comprising a human extracellular domain, a rat transmembrane domain and a 
rat intracellular domain; and a chimeric T1R3 receptor subunit comprising a rat 
extracellular domain, a human transmembrane domain and a human intracellular 
domain. The mvention provides functional taste receptors, preferably human taste 
receptors, that are produced by co-expression of a combination of di£Ferent TlRs, 
10 preferably T1R1/T1R3 or T1R2/T1R3, and the corresponding isolated nucleic acid 
sequences or fragments, chimeras, or variants thereof that upon co-expression result in 

a functional taste receptor, i,e., a sweet taste receptor (T1R2/T1R3) or umami taste 
receptor (TlRimR3). 

TlRs, a fiimily of class C G protein-coupled receptors (GPCRs), are selectively 
15 expressed in the taste tissue (Hoon, MA., et al.. Cell, 1999. 96(4): p. 541-51, 

Bachmanov, A.A., et al., C3nem Senses, 2001. 26(7): p. 925-33, Montmayeur, J.P., et al., 
Nat Neurosd, 2001. 4(5): p. 492-8, Max, M, et al., Nat Genet, 2001. 28(1): p. 58-63, 
BCitagawa, M., et al., Biochem Biophys Res Conamun, 2001. 283(1): p. 236-42 and 
Nelson, G., et al., CeU, 2001. 106(3): p. 381-90.) Functional expression of TlRs in 

20 HEK293 cells revealed that different combinations of TlRs respond to sweet and 

umami taste stunuU (Nelson, G., et al., Cell, 2001. 106(3): p. 381-90, Li, X., et al., Proc 
Natl Acad Sci U S A, 2002. 99(7): p. 4692-6.) T1R2 and T1R3, when co-expressed in 
293 cells, recognize diverse natural and synthetic sweeteners [For the reason mentioned 
above re "diverse", please consider whether we need this section for enablement. If not, 

25 I'd delete. We can discuss], while TlRl and T1R3 recognize umami taste stimulus Lr 
glutamate, and this response is enhanced by 5 '-ribonucleotides, a halhnark of umami 
taste. Knockout data confirmed that TlRs indeed mediate mouse sweet and umami 
tastes (Damak, S., et al.. Science, 2003 301(5634): p. 850-3, Zhao, G.Q., et al., CeU 
2003 Oct 31;115(3):255-66). 

30 The class C GPCRs possess a large N-terminal extracellular domain, offen 

referred to as the Venus flytrap domain (VFD) (Pin, J.P., Pharmacol Ther, 2003 98(3): 
p. 325-54), and are known to function as either homodimers, in the cases of 
metabotropic glutamate receptors (mGluRs) and calcium- sensing receptor (CaR), or 
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heterodimers, in the case of y-amiiiobutyric add typp B receptor (GABAbR). The 
functional expression data shows a heterodimer mechanism for TlRs: both TlRl and 
T1R2 need to be coexpressed with T1R3 to be fimctional, which is supported by the 
overlapping e)q)ression patterns of TlRs in rodent tongue. 
5 It is established herein that TIR family members act in combination with other 

TIR family members to function as sweet and umami taste receptors. As disclosed in 
further detail infra in the experimental examples, it has been demonstrated that 
heterologous cells which co-express hTlR2 and hTlR3 are selectively activated by 
sweet taste stimuli in a manner that mirrors human sweet taste. 
10 For example, HEK-293-Gal5 cells that co-express hTlR2 and hTlR3 

specifically respond to cyclamate, sucrose, aspartame, and saccharin, and the dose 
responses for these compounds correlate with the psychophysical taste detection 
thresholds. 

Also, as supported by data in the experimental examples, it has been shown that 
15 ceUs which co-e5qpress hTlRl and hTlR3 are selectively activated by glutamate 
(monosodium glutamate) and S'*ribonucleotides in a manner that mirrors human 
umami taste. For example, HEK-293-Gal5 cells that co-express hTlRl and hTlR3 
specifically respond to glutamate and the dose response for this umami-tasting 
compound correlates with its psychophysical taste detection threshold. Moreover, 5*- 
20 ribonucleotides such as IMP enhance the glutamate response of the T1R1/T1R3 
receptor, a synergism characteristic of umami taste. 

Further, as shown by experimental data in the examples it has been shown that 
cells which stably and inducibly co-express T1R1/T1R3 selectively respond to the 
umami taste stimuli L-glutamate and L-aspartate and only weakly respond to other L- 
25 amino acids, and at much higher concentrations, providing further evidence that the 
T1R1/T1R3 receptor can be used in assays to identify compounds that modulate 
(enhance or block) umanod taste stimuU. 

Examples of compounds that specifically bind to the sweet recq)tor and 
modulate sweet taste can be found in Table 5. 

30 Tables 1-4 provide examples ofcompounds that specifically bind to tile umami 

receptor and modulate umami taste. 
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Table I'Umanii Amides 



Compound 
No. 



Compound 



Umami 
ECfio (uM) 



Ecso ratio (vs. 
MSG) 



Al 




0.22 



3,6-DichlQro-N-{4-ethoxy-phenyl)-2- 
metfaoxy-benzamide 



2.74 



A2 




0.93 



4-(3,6-DichIoro-2-znethoxy- 
beiizoylanmio)-beiizoic acid methyl 
ester 



6.98 



A3 




1.08 



2,5-dichloro-N-(4- 
ethoxyphenyl)beiizamide 



6.14 



A4 




0.4 



2-[(Beiizo[b]thiophene-2-carbonyl)- 
iamino]-4-methyl-pentanoic acid 
methyl ester 



A5 




0.31 



2-[(Benzofuraii-2-carbonyl)-amino]-4- 
methyl-pentanoic acid methyl ester 



A6 




\ 



0.32 



2-[(5-Meflioxy-benzofuran-2- 
caifoonyl)-anuno]-4-methyl-pentanoic 
acid methyl ester 



2.86 
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Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 
EC50 (uM) 



Ecso ratio (vs. 
MSG) 



@ 
(uM) 



A7 




0.46 



(R)-5-methoxy-N-(l-meflioxy-4- 
methylpentan-2-yl)benzofiiran-2- 
carboxamide 



A8 




0.5 



5-methyl-N-(5-methylhexan-3-yI) 
benzofuran-2-caTboxamide 



A9 




0.71 



2-[(Benzo£uran-5-carbonyl)-amino]-4- 
metiiyl-pentanoic acid methyl ester(R)- 
mefhyl 2-(benzofuran-5-carboxamido)- 
4-methylpentanoate 



AlO 




0.91 



N-(heptan-4-yl)-5-methoxybenzofuran- 
2-carboxamide 



4.51 



All 




1.05 



5-chloro-N-(l-methoxybutan-2- 
yl)beiizofuraii-2-carboxamide 



6.5 



0.3 



A12 




1.13 



5-methoxy-N-(2-methylhexan-3- 
yl)benzofuran~2-carboxamide 
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Table 1 - Umami Amides 



Compound 
No. 



Compomid 



Umami 
ECso (iiM) 



Ecso ratio (vs. 
MSG) 



A13 




1.14 



5-mefhoxy-N-(pentaii-3-yl)beDzofuian- 

2-carboxaniide 



4.46 



A14 




1.14 



2-[(5-Methoxy-benzofuran-2- 
carbonyl)-amino]-4-methylsiilfanyIr 
butyric acid methyl ester methyl 2-(5- 
methoxybeiizofuran-2-carboxamido)-4- 
(metfaylthio)butanoate 

,1/- 



A15 




L14 



^ ^ 0 

(lR,2R)-ethyl2-(5- 
methoxybeiizofbiaii-2- 
carboxamido)cyclohexanecarboxylate 



A16 




1.18 



5-methoxy-N-(2-methylpentan-3- 
yl)benzofuran-2-catboxamide ■ 



A17 




12 



^ -0 0 
N-(2,4.dimetifciylpeiitan-3-yl)-5- 
methoxybenzofigan-2-caifaoxamide 



A18 




1.27 



5-methoxy-N-(2-methylheptan-4- 
yl)ben2ofi3ian-2-carboxamde 
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Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 
ECso (uM) 



Ecso ratio (vs, 
MSG) 



@ 
tuM) 



A19 




1.3 



5-methoxy-N-( 1 -inethoxypentan-2- 
yl)beiizofuran-2-carboxamide 



A20 




1.32 



5-methyl-N-(2-methylheptan-4-yI) 
ben2ofaran-2-carboxamide 



A21 




N-(pentan-3-yl)benzofuran-2- 
caiboxamide 



1.52 



3.74 



A22 




1.58 



Benzothiazole-6-caiboxylic acid (1- 
propyl-butyl)-amide 



A23 




0.38 



2-methyl-N-(2-methylheptan-4- 
yl)beDzo[d]oxazole-5*carboxamide 



A24 




1.12 



2-methyl-N-(2-metiiy]heptan-4- 
yl)benzo[d]oxa2ole-6-carboxaniide 
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Table 1 - Umami Amides 


Conqpoiind 
No. 


Compound 


Umaml 
ECm(uM) 


Ecso ratio (vs. 
MSG) 


@ 
(uM) 


A25 


(R)-4-Methyl-2-[(2-methyl- 
benzooxazole-6-carbonyl)-ammo]- 
pentanoic acid methyl ester 


1.48 






A26 


2-methyl-N-(2-methyIhexan-3- 
yl)beDZo[d]oxazole-6"Carboxainide 


1.6 






A27 


2-ethyl-N-<heptan-4-yl)beiizo[d] 
oxazole-6-carboxaimde 


1.61 






A28 


— / 

(R). 4-Meti[iyl-2.[(2-methyl- 
beiizooxazole-5-carbonyl}-ainino]- 
pentanoic acid methyl ester 


1.69 






A29 


N-(heptan-4-yl)benzo[d] 
oxazole-6-carboxamide 


1.91 






A30 


X ^0 HN ( 

3-brom(>-N-(heptan-4-yl)funiii-2- 
cailboxaiiiide 


0.49 


12.6 


1 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
ECsofuM) 


Ecso ratio (vs. 
MSG) 


@ 

(vm 


A31 


0 

N-(heptan-4-yl)-4,5-dime 
carboxamide 


jthylfiiranT2- 


0.62 


10.04 


1 


A32 


N-(2,3-dimethylcyclohex 
mefhylfuran-2-cait)0xax 


yl)-3 
mde 


- 


1.15 






A33 


M \ 

4,5-dimethyl-N-(2-methylcyclohexyl) 
furan-2-carboxaniide 


. L33 






A34 


(R)-methyl2-(lH-mdoIc 
carboxamido)-4-niethylpent 


T 

anoate 


0.53 






A35 


cc 

N-(heptan. 
ca 


0 

■4.yl)-lH-in 
iboxaznide 


dole-6- 


0.82 


8.81 


1 
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Table 1 - Umami Amides 



Compound 
No* 



Compound 



Umami 
ECso(uM) 



Ecso ratio (vs. 
MSG) 



@ 
(uM) 



A36 




1.01 



(R)-methyl 2-(lH-indole-5- 
carboxamido)-4-metiiylpentanoate 



A37 




1.5 



(R)-methyI 4-methyl-2-(quinoline-6- 
carboxamido)pentanoate 



A38 




1^2 



5-Methyl-thioplietie-2-carboxylic acid 
(l-propyl-butyl)-amide 



6.54 



A39 




131 



5-Metliyl-tliiopliene-2-carboxylic acid 
(l»2,3,4-tetcahydro-naphtfaalen-;l-yl)- 
amide 



2.3 



A40 




0.37 



(R)-methyl 2-(2-naphthamido)-4- 
methylpeiitanoate 
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Table 1 - Umami Amides 


Compomid 
No. 


Compound 


Umami 
ECso (uM) 


Ecso ratio (vs. 
MSG) 


@ 
(uM) 














A41 






0.7 


2.14 


3 




N-(nonan-5-yl)beEnzo[d][l,3]dioxole-5- 
carboxamide 








A42 


OH 
o 




0.35 








(2R,3R>methyl2- 
(benzo[d][l,3]dioxoIe-5-carboxamido)- 
3-mediylpentanoate 








A43 


0 




0.49 








2-[(Benzo[l,3]dioxole-5-carbonyl)- 
aminol-hexanoic acid mediyl ester 




















A44 




r 


0.61 








(R)-2-[(Benzo[l,3]dioxole-5- 
carbQiiyl)-amiiio]-hexanoic acid methyl 
ester 




















A45 


0 


y"'% 


0.88 








(R)-etfayl 2-(benzo[d][l,3]dioxole-5- 
caiboxamido)-4-mediylpentaiioate 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
ECso (uM) 


Ecjo ratio (vs. 
MSG) 


@ 
(uM) 


A46 


(R)-metiiyl 2-(2,3-dihydrobenzofuran- 
5-carboxamido)-4-methylpeiitanoate 


132 






A47 


(S)-N-(l,2,3,4-tetrahydronaphthaleii-U 
yl)benzord][l ,3]dioxole-5-carboxamide 


133 


6.42 


0.1 


A48 


N-(4-phenylbutan-2-yl)benzo[d] 
[l,3]dioxole-5-caTbo7camide 


1.51 


9.27 


1 


A49 


2-[(Benzo[l,3]dioxole-5-caibonyl)- 
aniino]-pentanoic acid methyl ester 


1.54 


9.53 


1 


A50 


0 

N-(b 


ei]zo[d][ 

Fopyi 


penta 


ioxol-5-yl)-2- 
namide 


1.57 







20 



wo 2005/015158 



PCT/US2004/025459 



Table 1 - Umami Amides 



Compound 
No. 



Compound 



Umami 



Ecso ratio (vs. 
MSG) 



@ 
(uM) 



A51 




1.58 



(R)-propyl 2-(beiizo[d][l,3]dioxoIe-5- 
carboxamido)-4-methylpentanoate 



A52 




1.65 



N-(heptan-4-yl)-2,3- 
dihydrobenzofuran-5-caiboxamide 



A53 




1.83 



N^exan-3-yl)benzo[d][l ,3] 
dioxole-5-carboxamide 



A54 




N-(hexan-3 -yl)-3 -methyl-4- 
(metfaylthio%eii2amide 



0.12 



ASS 




0.12 



methyl 2-(3-chloro^ 
meflioxybeii2ainido)hexanoate 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
ECsoCuM) 


Ecso ratio (vs. 
MSG) 


@ 
(uM) 


A56 


N-(hexan-3-yl)-3,4-imethylben2amide 


0.14 






A57 


(R)-methyl 4-iiiethyl-2-(4- 
vinylbenzamido)pentanoate 


0.18 






A58 


\ 

4-metho3cy-3-methyl-N-(2- 
mediylpentan-3-yl)benzamide 


0.2 






A59 


4-methoxy-3-methyl-N-(2. 
metiiylhBxan-3-yl)benzamide 


0.2 






A60 


(R)-methyl 2-(4- 
(ethylthio)benzamido)-4- 
me&ylpentanoate 


0.2 
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Table 1 - Urnami Amides 



Compound 
No. 



Compound 



Umami 
ECfio (uM) 



Ecso ratio (vs. 
MSG) 



A61 




0.22 



N-(heptan-4-yl)-4-methoxy-3- 
methylbenzamide 



A62 




0.25 



(R)-methyl2-{3,4- 
dimethylbenzamido}-3- 
mefliylbutanoate 



A63 




0.25 



(R)-methyl 2-(4-raethoxy-3- 
methylbenzamidoM-metliylpentanoate 



A64 




0.26 



4-etiioxy-3-mcthyl-N-(pentan-3- 
yl)ben2amide 



A65 




029 



(R)-N-(l-methoxy-4-methylpentan-2- 
yI)-3-metfayl-4-(methyltbio)ben2amide 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
ECso (uM) 


Ecso ratio (vs. 
MSG) 


@ 
(uM) 


A66 


0* ^ 

N-(2,4-diinethoxybeiizyl)-3-(lH- 
pyrrol- 1 -yl)isonicotinaniide 


0.29 






A67 


V- 






0.29 


10.75 


1 




CI 

methyl 2-(3-chloio-4- 
methoxybenzamido)pentanoate 








A68 


4-etlioxy-N-(heptan-4-yl)bei]zamide 


0.32 


2.62 


0.3 


A69 








0.32 








\==J 0 \ 
(R)-metiiyl 4-metIiyl-2-{4- 
methylbenzamido)pentanoate 








A70 


F \ 
F 






0.33 








N-(heptan-4-yl)-3- 
(trifluoromethyl)benzamide 
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Compound 
No. 



Table 1 - Umami Amides 



Compound 



Umami 
ECsoCuM) 



Ecso ratio (vs. 
MSG) 



@ 
(uM) 



A71 




0.34 



4-ethyl-N"(heptan-4-yl)ben2amide 



A72 




0.34 



4-eflioxy-3-methyl-N-(5-methylhexan- 
3-yl)benzaniide 




0.34 



(R)-methyl 2-(3-methoxy-4- 
methylbenzaraido)-4-methylpentanoate 



A74 



\ 




0.35 



3-fluoro-N-(heptan-4-yl)-4- 
methoxybenzamide 




0.39 



N-(heptan-4-yl)-4- 
(methylthio)beii2amide 



4.98 



0.3 



A76 




0.4 



4-methoxy-3-methyl-N-(4- 
phenylbutan-2-yl)beii2amide 
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Table l^TJmami Amides 



Compound 
No. 



Compound 




3-chloro-4-methoxy-N-(2- 
methylcyclohexyljbenzamide 



Umami 
ECsQ (uM) 



£c5o ratio (vs. 
MSG) 



@ 
(uM) 



A78 



A79 




0.46 



N-Qieptan-4«ylM-vinylbenzamide 




0.46 



N-(heptan-4-yl)-4-nieflioxyfaeiizamide 



10.22 



03 



A80 




0.47 



3-chloro-4-nietiboxy-N-(pentan-2- 
yl)benzamide 



5.12 



0.1 



A81 




0.5 



N-(faexan-3-yl)-4-methyl-3- 
(niethylthio)benzamide 
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Table 1 - Umami Amides 



Compound 
No, 



Compound 



Umami 
ECso(uM) 



Ecso ratio (vs. 
MSG) 



® 
(uM) 



A82 




0.51 



(R)-methyI 4-metiiyl-2-(4- 
propoxybeDzamido)pentanoatg 



ASS 




0.52 



N-(heptan"4-yl)-3-mefliyIbenzaniide 



A84 




0.53 



N-(heptan-4-yl)-2-hydroxy-3- 
medioxybepzamide 



A85 




0.53 



(R)-methyl2-(3,5- 
dimethylbenzamido)-4- 
methylpentanoate 



A86 




0.53 



methyl 2-(4-methoxy-3- 
methylbenzamido)-4- 
(methyltfaio)butanoate 
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Table 1 - Umanii Amides 


Conqiound 
No. 


Compound 


Umami 
ECsd(uM) 


EC50 ratio (vs. 
MSG) 


@ 
(uM) 


A87 


o 

2-hydroxy-3-methoxy-N-(l,2,3,4- 
tetrahydroDaphfhalen-l-yl)benzaimde 


0.54 


3.8 


1 


A88 


N-(2,4-dimethylpentaii-.3-yl)-3-ixiethyl- 
4-(methylthio)bei]zaimde 


0.55 






A89 


0 ' 

(R)-3-chloro-4-methoxy-N-(l-{4- 
methoxyphenyl)ethyl)beiizaixiide 


0.6 


2.85 


1 


A90 


JO 

N-(heptan-4-yi: 


i-3-methoxybenzamide 


0.61 






A91 


(R)-methyl 4-methyl-2-(4- 
propylbenzaniido)pentanoate 


0.62 
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Table 1 - Umami Amides 



Compound 
No, 


Compound 


Umami 
ECso (nM) 


Ecso ratio (vs. 
MSG) 


@ 
(uM) 


A92 


i 

4-etiioxy-3-methyl-N-(2-methyllieptan- 
4-yl)benzanii(ie 


0.65 






A93 


(S)-2-hydroxy.3-raethoxy-N-(l,2,3,4- 
tetrahydionaphthalen-l-yl)benzamide 


0.7 


5.7 


1 


A94 


(RH-methoxy-N-(2-methoxy-l- 
phenylethyl)-3-methylbenzamide 


0.72 






A95 


(R)-methyl 2-(4.mettidxy-3,5- 
dimethylbenzamido)-4» 
methylpentanoate 


0.74 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
ECso (uM) 


Ecso ratio (vs. 
MSG) 


@ 
fuM) 














A96 




0 L 

4-inethoxy-N-{l-(4- 
inethoxyphenyl)propyl)-3- 
methylbenzamide 


0,76 


















A97 






0.85 








4-mcthoxy-N-(l-me11i0xypentan-2-yl)- 
3-methyIbenzamide 




















A98 


V 


V=/ 0 

a 

3-chloro-N-(l-hydioxy-4~ 
metliylpezitan-2-yl)-4- 
methoxybenzamide 


0.88 






A99 






0.89 








(R)-methyl 4-mefliyl-2-(3- 
methyIbenzamido)pentanoate 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
EC„(uM) 


Ecso ratio (vs. 
MSG) 


@ 
(uM) 


AlOO 


0 1 

3-chloro-4-methoxy-N-(l-p- 
tolylethyDbenzamide 


1.1 






AlOl 


N-(heptan-4-yl)-2-hydroxy-4- 
methoxybenzamide 


1.16 


7.62 


1 


Ai02 


4-hydroxy-3-methyI-N-{l,2,3,4- 
tetrahydronaphtlialen-l-yl)benzamide 


IJ2 


9.49 


1 


A103 


(lS,2R)-e%l 2-(3-cliloro-4- 
mBdio:^ei]zaimdo) 
cyclohexanecaiboxyhte 


1.36 
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Table 1 - Umami Amides 


Compound 
No. 


Compound 


Umami 
ECso (uM) 


Ecso ratio (vs. 
MSG) 


@ 
(uM) 












A104 


Biphenyl-2-carboxylic acid 2,4- 
dimethoxy-benzylamide 








A105 


(S)-N-{l,2,3,4-tetraliydronaphthalen-l- 
yl)-4-vmyIbenzamide 


1.38 


2.79 


1 


A106 


J \J 

3-chloro-N-{2,3-dihydro-lH-mden-l- 
yl)-4-meihoxybenzaniide 


1.39 


4.01 


0.3 
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Table 2 - Umami Ozalamides 


Compound 
No. 


Compound 


Umami ECso 
fuM) 


Ecso ratio (vs, 
MSG) 


Bl 


Nl-{2,4-4imethoxybenzyl)-N2-(2-(iura3i-2- 
yl)e11iyl)oxalamide 


0.18 




B2 


^0 0 

Nl-(4-ethoxy-2-ineflioxybenzyl)-N2-(2-(5- 
nietliyIpyridm-2-yl)ethvl)oxalamide 


0.19 




B3 


N-<3-Methyl-benzo[b]tinophen-2-ybnethyl)-N'-(2. 
pyridm-2-yl-ethyl)-oxalamide 


0.81 




B4 


oH 

0 0 

Nl-(2-isopropoxybenzyl)-N2-(2-(pyridm-2- 
yl)etliyl)oxalamide 


1.22 
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Table 3 -Umami Ureas 


Compound 
No. 


lUJrAC IName 


Uinami 

EC50 
uM 


Ec50 
ratio (vs. 
MSG) 


Con. 
(iiM) 


CI 


CI ^ V 

0 \ 

l-(2-chlorophenyl)-3-(heptan-4-vl>urea 


0.37 


4.95 


1 


C2 


CI HN— ^ ^ 

0 

l-(2,4-dichlorophenyl)-3-(l-phenylpropyl)urea 


0.49 


4.52 


1 


C3 


CI ^ 

l-(2-chlorophenyl)-3-(2-methylcyclohexyl)urea 


0.S2 


3.24 


3 


C4 


F HN— ^ > y 

l-f2-fluorophenyl)-3-flieptan-4-yl)urea 


0.79 


12.15 


3 


C5 


ci-^ O 

l-(2-chlorophenyl)-3-(l-cyclohexyIeliiyl)urea 


0.84 


9.08 


1 


C6 


0 

1 loupiupyipiicnyi^O'^^-^^pynajn-z-yijeinyijiirea 


0.98 






C7 


l-(2-chlorophenyl)-3-(l,2,3,4-tetrahydronaphflialeii- 
l-yl)urea 


0.99 


3.68 


1 , 
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Table 3- 


Umami Ureas 


Compound 
No. 


lUPAC Name 


Umami 
uM 


£c50 
ratio (vs. 
MSG) 


Con. 
(uM) 


C8 


l-(2,4-dimefhoxyphenyl)-3-(2- 
methylcyclohexyl)urea 


1.41 


2.62 


0.3 


C9 


o 

\ 

1 -(2-etiiylphenyl)-3-(heptan-4-yl)urea 


1.42 






CIO 


O 

/° \ /~NH y — ^ 

l-C4-etlioxvDhenvlV3-r2-methvlcvclohexvl^uTea 


1.51 


2.1 


0.3 


Cll 


l-(2-£Iuorophenyl)-3-(l,2,3,4-tetrahydronaphihalen- 
l-yl)urea 


1.65 


4.49 


1 


C12 


— 0 HN— > ' 

O 

l-(2-meflK>xyphenyl)-3-(2-methylcyclohexyl)urea 


1.67 






C13 


l-(2,4-dimethoxyphenyl)-3-(pentazi-3-yI)iirea 


1.72 


11.87 


1 
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Table 4 - Umami Acrvlamides 


Compound 
No. 


Compound 


Umami ECso 
(uM) 


Ecso ratio 
(vs. MSG) 


@ 
(uM) 


Dl 


(E)-N-(2,4-dimethylpentan-3-yl>^ 
niethoxypheiiyOacrylamide . 






1 
1 


D2 


(R,E)-methyl 2-(3-(4-metiioxyphenyl) 
acrvlaimdo)-4-methylpentanoate 


0.32 






D3 


.(E)-methyl 2-(3-(4-methoxyphenyl) 
acrylaniido)hexanoate 


0.63 






D4 


N-(l-Methyl-3-phenyl-propyl)-3- 
thiophen-2-yl-acrylamide 


0.69 


9.73 


1 


D5 


(E)-N-(heptan-4-yl)-3-(4 
methcxyplienyOacrylamii 


— 

ie 


0.72 


3.48 


0.3 


D6 


N-(l-Propyl-butyl)-3-thiophen-2-yl- 
acrylamide 


0.75 


6.3 


1 


D7 


0 

(B)-3-(4-methoxyphenyl)-N- . 
(pentan-S-yDaciidaoiide 


0.82 


9.62 


1 
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Table 4 - Umami Acrylamides 


Compound 
No. 


Compound 


Umami EC50 
(uM) 


Ecso ratio 
(vs. MSG) 


@ 
fuM) 


D8 


(R,E)-3-(4-€tiioxyphen^ 
me11ioxy-4-methylpei 
yl)acrylaniide 


yl)-N-(] 
atan-2- 




0.94 






D9 


^^^^^ \ 

(Z)-N-(heptan-4-yI)hex-2-enainide 


0.98 






DIO 


0 

(R^)-methyi 4-methyl-2-{3-<thiop] 
3-yl)acrylaimdo)pentaiK)ate 


=0 
bsa- 


1.09 






Dll 


(R)-metfayl 2-cinnainamido-4- 
methylpentanoate 


1.17 






D12 


(E)-4«inethyl-N-(2-methylcyclohexyl) 
pent-2-enamide 


1.28 






D13 


0 

(E)-N-sec-butyW-(4- 
etfaoxyphenyDacrylamide 




1.31 


2.7 


0.3 
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Table 4 - Umami Acrylamides 


Compound 
No, 


Compound 


Umami EC50 


EC50 ratio 


@ 

iUiVXJ 












D14 


(E)-N-(l-methoxybutan.2-yl).3-(4« 
metfaoxyphenyl)acryIan]ide 


1.43 


8.48 


1 


D15 


(E)-N-(heptan-4-yl)-3- 
( tliioDhen-3 -vl^acrvlamide 


1.54 


2.22 


0.3 


D16 


(E)-3-(3,4-dimethoxyphenyl)-N-(4- 
phenylbutan-2-yl)acrylamide 


1.56 


3.13 


1 
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Table 5 - Sweet EnhancerAmides 


Compound 
No. 


Compound 


Sweet 
ECso 
oM 


Umami 
iiM 


Umami 
EC50 
ratio 












El 


OH 

CI 

3-chloro-2-hydroxy-N-(2.methyl-l,2,3,4« 
tetrahydronaphthalen- 1 -yObenzamide 


0.19 
















E2 


cx'/o 

OH 

CI 

^)-3-chloro.2-hydioxy-N-(l,2,3,4- 
tetrahydronaphfhalen-l-yObenzaimde 


0.65 








\=/ HN-/ ) 








E3 


CI OH /=( 

3-chloro-2-hydroxy-N-(5-hydroxy-l,2,3,4- 
tetrahydronaphthaleu-l-yObenzaimde 


1.03 








/ \ 


















E4 


CI OH 

3-chloro-2-hydroxy-N-(4-inethyl-l,2,3,4- 
tetrahy dronaphthalen- 1 -y^benzamide 


1.61 
















E5 


3-chloro-2-hydroxy-N-(6-methoxy-l,2,3,4- 
tetrahydronaphdialen-l-yl)benzaimde 


1.61 







39 



wo 2005/015158 



PCTAJS2004/025459 



Table 5 - Sweet £nhancerAinides 


Compound 
No, 


Compound 


Sweet 
ECso 
uM 


Umaml 
EC50 
uM 


Umami 
£Cso 
ratio 


£6 






1.48 








3-methyl-N-<2-methyl-l,2,3,4-tetrahydrQn^hthaleii- 
1 -yl)isoxazole-4-carbcBcaniide 








E7 


O f^^^ 

CI 

3-cliloro-2-hydroxy-N-(l,2,3,4- 
tetrahydronaphtlialen-1 -yl)benzamide 


1.81 




4.04 














£8 


OH 

2,3-dfliydro^-N-(2-inefliyl-U,3,4- 
tetialiydronaiditlialen-I-yl)benzaimde 


1.98 








0 ^^'1^'^ 










£9 


! I " T 1 

OH 

2-hydioxy-N-(2-methyl-l,2,3,4-tetraliydronaphthalen- 
l-^l)benzamide 


2.36 






£10 


OH 

2,3Kiihydroxy-N-(5-methoxy-l,2,3,4- 
tetrahydronaphtfaalen-l-yl)benzamide 


2.44 







I 
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Table 5 - Sweet EnhancerAmides 



Compound 
No. 



Compound 



Sweet 
EC50 
uM 



Umami 
EC50 
uM 



Umami 
EC50 
ratio 



Bll 




2.46 



3-methyl-N-(4-methyl-l,2,3,4-tetrahydronaphthalen- 
. l"yl)isoxazole^carboxamide 



E12 




2.85 



N-(5-methoxy- 1 ,2,3 ,4-.tetrahydronaplithalen-l-yl)-3- 
methylisoxa2ole-4*carboxamide 



E13 




2.91 



(S)-3-chloro-2-methyl-N-{l,2,3,4. 
tetrahydronaphthalen-l-yObenzamide 



£14 




(S)-2,6-dimethyl-N-(l,2,3,4- 
tetrahydronaphthalen- 1 -yl)benzaimde 



2.91 



E15 




3.02 



2,6-dichloro-N-(l,2,3,4- 
tetrahydronaphthalen-l-yl)benzaniide 



E16 




3,6-dichlpro-2-meliioxy-N-(l,2,3,4- 
tetrahydronaphtfaalen-I-yDbenzamide 



3.04 
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Table 5 - Sweet EnhancerAmides 


Compound 
No. 


Coiiq>ound 


Sweet 
uM 


Umami 

£^50 


Umami 
ECso 
ratio 


E17 


OH 

(R)-23-dihydroxy-N-(l,2,3,4-tetrahydron^hthalen- 
l-yl)benzamide 


3.13 






E18 


0 1 1 

ll if 1^ ll 

2,5-dihydroxy-N-(5-metiioxy- 1,2^,4- 
tetrahydranaph1]ialen-l-yl)beiizatnide 


3.38 






E19 


F 

{S)-3-fluoro-2-methyl-N-(l,2^,4- 
tetiahyclronaphfhaIeii-l-yl)beiizainide 


3.57 






E20 


(S)-3-chloro.2,6-dimethoxy-N-(l ,2,3,4- 
tetrahydronaphthalen- 1 -yObenzamide 


4.13 






E21 


■ X CI 

(R)-5-bromo-N-(l A3,4-tetrahydroiiaphthalen-l- . 
yl)mcotiiiamide 


4.19 
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Table 5 - Sweet EnhancerAmides 


Confound 
No. 


Compound 


Sweet 
ECso 
uM 


Umami 
ECso 
uM 


Umami 
EC50 
ratio 












E22 


CI 

(R)-3-chloro-N-(l,2,3,4- 
tetrahydronaphthalen- 1 -yl)benzaniide 


4.52 








0 Pi 








E23 


F 

(R)-3-fluoro-N-(l,2,3,4-tetraliydronaphthalen-l- 
yl)benzamide 


4.86 






E24 


0 r^'"'^ 

II 

(R)-2.5-dihydroxy-N-(l,2,3,4. 
tetiahydn>naphflialen-l-yl)beiizaiDide 


6.04 
















,E25 


O ■ O 

(R)-3-methyl-N-(l,2,3,4-tetrahydronaphtiialen-l- 
yl)isoxazole-4-C2irboxaniide 


7.79 
















E26 


{R)-5-methyl-N-(lA3,4-tetrahydron^lithaleii- 
1 -yl)isoxazole-4-caxboxaimde 


8.09 






E27 


F 0 ' 
2,3,5,6-tetrafluoro-4-methyl-N-(3-inethyIbulaii-2- 
yl)beiizanude 


0.14 
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Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compomid 


Sweet 
£Cso 
uM 


Umami 
ECso 
uM 


Umami 
ECso 
ratio 






















£28 


F 0 1 

N-(3,3-dimethylbiitan-2-yl)-2,3,5,6-tetrafluoro4- 
methylbenzamide 


0^1 


























E29 


N-(2-methylcycloliexyl)-3- 
(triihioiomethoxy)bei]zamide 


0.42 








p 








E30 


3-cbloto-5-fIuoro-N-(2-mefhylcycIohexyl)benzamide 


0.45 






E31 


F O ■ 

(R)-N-(3,3-dimethylbutan-2-yl)-2,3,5,6-tetrafluoro-4- 
methylbenzamide 


0.49 








1 








E32 


4-fluoro-N-(2-niethylcyclohexyl)-3- 
(tri£luoroinethyl)bepzaimd6 


0.51 
















E33 


CI 

2,5-dichloro-N-(2-methylcycloliexyl)benzamide 


0.63 
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Table 5 - Sweet EnhancerAmides 


Conqiound 
No. 


Compound 


Sweet 
uM 


Umaml 
uM 


Umaml 
EC50 
ratio 














E34 






0.71 








F 0 ' 

2,3,5,6-tetrafluoro-N-(hexaii-2-yl)-4- 
methylbenzamide 












1 1 








E35 




0,71 










^0 0 k, ) 










3,5-dicliloro-2,6-dimethoxy-N-(2- 
methylcyclohexyDbenzamide 








E36 




XX - 1 

o 


0.72 








2A6-ti±ne±yl-N-^2-methylcyclohexyl)benzaimde 








B37 


CI 


^0 0 


0.77 








3,6-dicliloTO-2-n]etiioxy-N- 
(2-methylcycloliexyl)beiizaimde 








E38 




•p 


0.9 








F O ^ 

(S)-N-(33-dimethylbutan-2-yl>2,3,5,6-tetnifluo^^ 
methylbenzamide 








£39 




CI 0 LJ 


0.91 








■ 2,6-dicUoro-N-(2-ii]et]iylcycIobexyl)beDzaxiiide 
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Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
ECso 
uM 


Umami 
uM 


Umami 
ECso 
ratio 


E40 


2-chloro-6-methoxy-N-(2- 
methylcyclohexyOisosicotmamide 


0.95 




9.77 


E41 


\ / 

N-((2R)-bicyclo[2.2.1]heptan-2-yl)-2,3,5,6- 
tetrafluoro>4-methylbenzaii]i(le 


1.02 






£42 


N-(l-methc 


xyh 


utaii-2-yl)-2,4-dimetiiylbenzaimde 


1.06 






E43 


F 

^^^^^^^^^ ^X^^^^^ 

N-(23-dimefiiylcyclohexyl)-2,3,5,6-tetrafluQro 
-4-methylbeDzamide 


1.08 






E44 


y— NH / 
2-cMor()-N-(2,3-dimethylcyclohexyl)isoiiicotinainide 


1.08 






E4S 


N-<:ycIohexyl-2.3,5.6-tetrafluoro-4-inethyIbenzaimde 


1.13 
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Table 5 - Sweet EnhancerAmides 



Compound 
No. 



Compound 



Sweet 
SCso 
uM 



Umami 
ECso 
uM 



Umami 
£C5o 
ratio 



E46 




1.25 



F F 

N-cyclooctyl-2,3,5,6-tetrafluoro-4-mefliylbenzamide 



E47 




1.25 



(R)-2,3,S,6-tBtrafIuoro-4-methyl-N-(3-methyIbutan-2- 

- yl)benzainide 

^cr ^ 



E48 




1.29 



3,6-dichloro-N-(2,3-dimethylcyclohexyl)-2- 
methoxybenzamide 



£49 




1.39 



N-cycloheptyl"2,4,6-trimethylbenzamide 



E50 




1.41 



N-(2,3-dimethylcyclohexyl)-2,4,6- 
trime&^Ibenzamide 



E51 




1.49 



3-chloro-N-(2,3-dihydro-lH-mden-l-yl)-2- 
hydroxybenzamide 
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Table 5 - Sweet EntaancerAmldes 


Compound 
No. 


Compound 


Sweet 
EC50 
uM 


Umami 
ECso 
uM 


Umami 
ratio 












E52 


HN— ( ) 
2-metfayl-N'<2-methylcyclohexyI)-l-naphfhaimd€ 


1.52 








fir 








E53 


3-cUoro4-fluoro*N-(2-met]^lcyclo]iexyl)benzainide 


1.7 








O 








E54 


3,4-dichloro-N-(2-methylcyclohexyl)benzainide 


1.83 




10.66 












E55 


5-bromo-N--(2,3-dimeAyIcyclohexyl)nicotiiiaiiiide 


1.89 
















ES6 


2-cMoro-N-<2-methylcyclohexyl)isoiiicoti]iaimde 


1.92 




2.08 


E57 


CI 0 

2-chloroO-niethyl-N-(2-methylcyclohexyl)benzamicie 


1.95 
















ESS 


F F 

N-cyclopentyI-2,3,5,6-tetra£luoio-4-methylbeiizaniide 


2.23 
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Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
EC50 
uM 


Umami 
ECso 
uM 


Umami 
EC50 
ratio 


E59 


^^^^^^^^^^^^ 

N-(2-methylcyclohexyl)-3- 
(trifluoroinethyDbeiizaiiiide 


234 




2.07 


E60 


Fv H 

4-£luoro-N-(4-methylcyclohexyl)-3- 
■ ftrifluoTOinethvnbeiizaniide 


2.37 






E61 


2-fluoro-N-(2-iiiethylcycIoliexyl)-3- 
(tri£luoromefhyl)beiizamide 


2.4 






£62 


5-brQmo-'N-(2-mefhylcyclohexyl)mcotiiiamide 


2.42 






E63 


23--dimethyl-N-(2-methyIcyclohexyl)benz^ 


2.6 






E64 


^^^^^^ "^Cx^ 

2,6-dicMoro-N-(2,3-4iinethylcyclohexyl)benzaimde 


2.77 






E65 . 


O 

-oo 

2-fluoro-N-<2-methylcyclohexyl)isomcotixianiide 


2.83 
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Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
uM 


Umami 
ECso 
uM 


Umami 
ECso 
ratio 


E66 


N-cvclohexvl-2 4 6-1ninethvIbenzaTTiide 


2.86 






E67 


OH 

2-hydroxy-4-methyl-N-(4- 
methylcycIohexyl)benzamide 


2.98 






E68 


F. 

N-Cheptan-^ 


Ok 

c 


r"xr 

nbromethyObenzainide 


3.03 


0.33 




E69 


F 0 

2,3,5,6-tetrafluoro-N-isobutyl-4-methylbenzamide 


3.19 






E70 


P 

F O 1 
2,3,5,6-tetrafluoio-4-iiiethyl-N-(5-methylhexan-2- 
yl)benzaniide 


3.2 






E71 


N-(2-methylcyclohexyl)benzo[c][l ,2»5]oxadiazoIe-S- 
carboxamide 


3.33 






E72 


2-hydroxy«3-methoxy-N-(4- 
methylcyclohexyDbenzamide 


3.35 
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Table 5 - Sweet Enhancer Amides 



Compound 
No. 



Compound 



Sweet 
EC50 
uM 



Umami 
EC50 
uM 



Umami 
ECso 
ratio 



E73 




3.36 



Thiopheiie-2-caiboxylic acid (1,3,3-trimethyl- 
bicy;lo[2.2. l]hept-2-yl>-amide 



E74 




3.62 



N-{2,3-dmietliylcyclohexyl)-2- 
(perfluorophenyl)acetaniide 



E75 




3.78 



CI O 

2,3-dichloro-N-(pentan-3"yl)benzamide 



E76 




3.99 



23-<iiclilorO"N-(2,3-dimethylcyclohexyl)beiizamidc 



E77 




4.11 



F O 

N-(2,3-dimetfaylcycIohexyl)-2,5~difluorobeD2amide 



E78 




4.24 



8.51 



4,5-Dichloro-isothiazole-3-carboxylic acid (2-methyl- 
cyclohexyl)-aniide 
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Table 5 - Sweet EnhancerAmides 


Compound 
No. 


Compound 


Sweet 
EC50 

UlTX 


Umami 
ECso 
nivi 


Umami 
ECso 
ratio 


E79 


OH 0 

N-(2,4-aimetnylpentaiL'-3-yl)-2,o- 
dihydrox^enzamide 


4.28 






E80 


3-chloro-2-methyl-N-(2-mefliylcyclohexyl)benzamide 


4.29 






E81 


O 

3,4-difluoro-N-(2-methylcyclohexyl)benzan]ide 


.4,37 




6.98- 


£82 


3,5>-dimethyl-N-(2-methylcyclohexyl)benzamide 


4.48 






E83 


N-(4-ethoxyphenethyl)- 1 -raethyl-lH-pyrazole-5- 
carboxamide 


4.68 






E84 


0 

3,6-dichloro-N-(2-fluorophenyl)-2- 
methoxybenzamide 


0.83 




16.51 
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Table 5 - Sweet Enhancer Amides 



Compound 
No. 



Compound 



Sweet 
£Cso 
uM 



Umami 
SiCso 
uM 



Umami 
ECso 
ratio 



E85 




N-(2-Chloro-4,6-dimetiioxy-phenyl)-3- 
trifluorometiiyl-benzamide 



1.42 



£86 




3,5-dichloro-N-(2,4-dimethyIplienyl)-4- 
meflioxybenzamide 



1.48 



E87 




3-Chloro-4-fluoro-N-(5-trifluoromethyl- 
[1 ,3»4]thiadiazol-2-yl>'ben2amide 



1.55 



ESS 




3,5'dichlorQ-4-metfaoxy"N-o-tolylbeii2amide 



1.84 



£89 




5-Chloro-N"(2,4-difluoro-phenyl)-2-hydroxy- 
benzamide 



2.56 



£90 




2.71 



2,4-Dichloro-N"(2-cyano-3-fluoro-plienyl)-benzannde 
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Table 5 - Sweet Enhancer Amides 


Compound 
No. 


Compound 


Sweet 
EC50 
uM 


Umami 
EC50 
uM 


Umaml 

ECso 
ratio 




CI 0 iT^ 










E91 














2,6-Diclil(»o-N-(4-cyaiio-plieay])-benzaiiiide 










0 










E92 


4-chloro-N-(2,4-dime1iiylphenyl)-3-meAy 


Ibenzamide 


2 74 










—O 








E93 


01 

3,5-dichloro-4-methoxy-N-(4- 
niethoxyphe^l)benzamide 




3.24 


















E94 






3.56 








3-chloro-N-(2,4-diinethoxyphenyl)-4- 
fluorobenzamide 












! 








E95 


5-Cyano-2,4-dimefliyl-6-methylsulfenyI-N-phenyl- 
nicotinamide 


3.58 


















E96 






3.73 








N-(4-tert-Butyl-thiazol-2-yl)-isonicotmainide 
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Table S • Sweet EnhancerAmides 


Compound 
No. 


Compound 


Sweet 
ECsD 
uM 


Umami 
BiCso 
uM 


Umami 
ratio 


E97 


3,6-DicMoro-N-<2,4-dimethyl-phenyl)-2-inethox 
benzamide 








E98 


N-(3-ethy]phenyl)-2-methoxy-6-ir 


lethylbenzamide 


4.63 






E99 


N-(4-bronM)-2,6-dimethylphenyl)isoindoline-2- 
carboxamide 


053 






ElOO 


N-(2-niethyl-4-mtFoplieiiyl)isoindolme-2- 
carboxamide 


13 






ElOl 


N-(2,4-difluoropIienyI)isoindolme-2-carboxaimde 


137 






E102 


N-(2-methyl-3-nitrophenyl)isoindolme-2- 
carboxaimde 


2.01 






E103 


N-(2,3.4-trifluoropheayl)isoindoliiie-2-cai1]oxamid6 


2.58 
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Table 5 - Sweet EnhancerAmides 


Compound 
No. 


Compound 


Sweet 
EC50 
uM 


Umami 
ECso 
uM 


Umami 
EC50 
ratio 




N-p-tolylisoindoIine-l-carboxamide 








E105 


N-(4-cblorop]ienyl)isomdoline-2-caiboxamide 


3,4 






E106 


N-(2-chlorophenyl)isoindoline-2-caiboxamide 


3.85 






E107 


N-(2,4-^chlorophenyl)isomdoline-2-caiboxamide 


4.15 






E108 


N-(4-methoxyphenyl)isoindoline-2-'Carboxamide 


4.99 






E109 


^ / 0 

N-<2,4-^cUorophenyl)-3,4-dihydroisoquinoliiie- 

2(lH)-carboxamide 


2.34 






EllO 


\ / 0 

N-(2-cyanopnenyl)-3,4-dmydroisoqumoline-2(lH)- 
carboxamide 


2.5 






Elll 


N-p-tolyl-3,4-dihydroisoquinoline-2(lH)- 
carboxamide 


4.27 
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Table 5 - Sweet EnhancerAmides 




Compound 
No. 


Compound 


Sweet 
ECso 
uM 


Umami 
uM 


Umflmi 
ECso 
ratio 


El 12 


N-(3-chloro-2-mefliylpheiiyl)-3,4- 
dihydroisoqumoIin6-2(lH)-carboxanude 


433 






BU3 


N-(2,4-dimethoxyphenyl>3,4-(3jhy<koisoquinoline- 
2(lH)-carboxanude 


4.44 
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Also, as supported by experimental data in the examples, it has been shown that 
cell lines which co-express T1R1/T1R3 or T1R2/T1R3 respectively respond to umami 
or sweet taste stimuli and a quantitative dose-responsive maimer which further supports 
a conclusion that specific binding to the T1R1/T1R3 and T1R2/T1R3 receptor can be 
5 used to define receptor agonists and antagonists, e.g., MSG substitutes, umami 
blockers, novel artificial and natural sweeteners, and sweet blockers. 

Also, as supported by data in experimental examples, it has been shown that the 
sweet taste blocker lactisole inhibits both the T1R2/T1R3 sweet receptor and the 
T1R1/T1R3 umami taste receptor. Compounds are provided herein that enhance, 

1 0 mimic, modulate or block sweet or umami taste. The fact that lactisole inhibits both the 
T1R1/T1R3 and T1R2/T1R3 receptors suggests that these receptors may share a 
common subunit which is bound by lactisole and potentially other taste modulators.. 
Therefore, this shows that some compounds which enhance, mimic, modulate or block 
sweet taste can have a similar effect on umami taste or vice versa. 

1 5 Further, as supported by data in experimental examples, it has been 

demonstrated that cell lines which stably co-express TlRs, i.e. T1R1/T1R3 or 
T1R2/T1R3, when assayed by automated fluorescence imaging very effectively 
respond to various sweet and umami taste stimuli, i.e. at magnitudes substantially 
greater than transiently transfected cells. Thus, these cell lines are especially well 

20 suited for use in high throughput screening assays for identifying compounds that 

modulate, block, mimic or enhance sweet or umami taste. However, the mvention also 
encompasses assays that utilize cells that transiently express a TIR or combination 
thereof 

Moreover, while the application contains data demonstrating that some TlRs 
25 act in combination, particularly T1R1/T1R3 and T1R2/T1R3, and tiiat such receptor 
combinations may be used in assays, preferably high throughput assays, it should be 
noted fliat the subject invention also encompasses assays that utilize TlRl, T1R2 and 
T1R3 alone or in combination with oflier proteins, e.g., other GPCRs. 

There are differences in human and rodmt sweet taste m terms of the ligand 
30 specificity, G protein coupling efficiency, as well as sensitivity to inhibitors. The 
species dijBFerences in TIR ligand specificity can be utilized to demonstrate that the 
sweet taste receptor indeed functions as a heteromeric complex, and that thei^ is more 
than one ligand binding site on the receptor. Furthermore, a functional Unk between 
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the sweet aad umami receptors mediated by T1R3 has been shown OBxample 16). 

Both human and rat sweet receptors can efficiently couple to a chimeric Gal S 
with the C-tenninal tail sequence from Gaii (Gais/u). For example, human but not rat 
T1R2/T1R3 selectively responds to a group of sweeteners, including aspartame, 
5 neotame, and cyclamate. This is consistent with taste physiology data. These 
differences in agonist specificity can be utilized to map their binding sites on the 
receptor. A chimeric TIR can be generated between human and rat genes, with a 
junction immediately before the transmembrane domain. Each TIR chimera therefore 
consists of two halves, the N-terminal extracellular domain, and the C-terminal 

10 transmembrane and intracellular domain, from diffeent species. For example, a 

chimeric T1R2, termed T1R2-R, has a sequence from the N-terminus of human T1R2 
frised to rat T1R2 C-teminal sequence. Responses to these chimeras can then be 
tested (Figure 22). 

Novel compounds and novel flavor, tastants, and sweet enhancers were 

15 discovered in the chemistry series of amide derivatives. The amide compounds also 
comprise certain sub-classes of amide derivatives or classes of derivatives related to 
amides, such as for example ureas, urethanes, oxalamides, acrylamides, and the like. 
These compounds, when used together with sucrose or alone, increase a response in 
vitro and concomitant increase in sweet perception in human testing. These 

20 compounds enhance other natural and synthetic sweet tastants. Examples of these 
compounds are listed in Table 5. 

In one embodiment, the invention provides novel compounds, flavorants, 
tastants, flavor enhancers, taste enhancers, flavor modifying compounds, and/or 
compositions containing them. 

25 In a more specific embodiment, the invention provides novel sweet flavorants, 

sweet tastants, sweet taste enhancers, and sweet taste modifiers and compositions 
containing them. 

More particularly, in another embodiment, the invention is directed to 
compounds that modulate, induce, enhance, or inhibit natural or synthetic sweet 
30 tastants, eg., naturally occurring and synflietic sweeteners. 

In another embodiment, the invention provides compositions, preferably 
compositions suitable for human or animal consumption, containmg at least one 
compoimd of the invention. These compositions include foods, beverages and 
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medicinals, and food additives which when added to foods, beverages or medicinals 
modulate the flavor or taste thereof, particularly by enhancing tiie sweet taste thereof. 

Another embodiment of the invention is directed to use of a compound of the 
invention to modulate the sweet taste of a desired food, beverage or medicinal, which 
5 composition may comprise one or more other compounds that elicit a sweet taste.These 
compounds, when they were used together with naturally occuning and synthetic 
sweeteners, not only increased a response in }>itro but also intensified the sweet and 
other flavor or taste perceptions in human tasting. These specific compounds, when 
they were used together with sweet tastants, such as naturally occurring and synthetic 

10 sweeteners, not only increased the T1R2/T1R3 response in vitro but also intensified flie 
sweet taste and other flavor or taste perceptions in human tasting. 

Novel confounds and novel flavor, tastant, and umami enhancers and tastants 
such as amides, ureas, amino-amides, amido-amides, and P-lactams are also disclosed 
herein. These compounds, when used together with MSG or alone, increase a response 

IS in vitro and the umami perception in human tasting. These compounds also enhance 
other natural and synthetic umami tastants. Examples of these compounds are listed in 
Tables M. 

In one embodiment, the invention provides novel compounds, flavorants, 
tastants, flavor enhancers, taste enhancers, flavor modifying compoxmds, and/or 
20 compositions containing them. 

In a more specific embodiment, the invention provides novel umami flavorants, 
umami tastants, umami taste enhancers, and umami taste modifiers and compositions 
containing them. 

More particularly, in another embodiment, the invention is durected to 
25 compounds that modulate (induce, enhance or inhibit) natural or synthetic umami 
tastants, e.g., monosodium glutamate (MSG). 

In another embodiment, the invention provides compositions, preferably 
compositions suitable for human or animal consumption, containing at least one 
compound of the invention. These compositions include foods, beverages and 
30 medicinals, and food additives which when added to foods, beverages or medicinals 
modulate the flavor or taste thereof particularly by enhancing the umami taste thereof. 

Another embodiment of the invention is directed to use of a compound of tiie 
invention to modulate the umami taste of a desired food, beverage or medicinal, which 
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composition may comprise one or more other compounds that elicit a umami taste, 
e,g., MSG. These compounds, when they were used together wifli MSG, not only 
increased a response in vitro but also intensified the umami and other flavor or taste 
perceptions in human tasting. These specific compounds, whm they were used together 

5 with umami tastants, such as MSG, not only increased the TlRi/TlR3 response in vitro 
but also intensified the umami taste and other flavor or taste percq)tions m human 
tasting. Some of the compounds, when they were tasted alone, elicited human 
perception of umami. 

Compounds deJSned by specific binding to specific receptors using the 

10 present TIR assays can be used to modulate the taste of foods and beverages. 
Suitable assays described in fiirther detail infra include by way of example whole- 
cell assays and biochemical assays, including direct-binding assays using one of a 
combination of different TIR recq)tors, chimeras or fragments thereoJ^ especially 
Augments containing N-terminal ligand-binding domains. Examples of assays 

15 appropriate for use in the invention are described in greater detail infira and are 

known in the GPCR field, 

Assays can be designed that quantitate the binding of different compounds or 
mixtures of compounds to TIR taste receptors or TIR taste receptor combinations or 
TIR receptors expressed in combination with other heterologous (non-TlR) proteins, 

20 e.g. other GPCRs, or that quantitate the activation of cells that express TIR taste 
receptors. This can be effected by stably or transiently expressing taste receptors in 
heterologous cells such as HEK-293,CHO and COS cells, Thus, this physico- 
chemical characteristic of the compounds is used to define a genus of compound that 
share this characteristic. 

25 The assays will preferably use cells that also express (preferably stably) a G 

protein such as Gal 5 or Gal6 or other promiscuous G proteins or G protem variants, or 
an endogenous G protein. In addition, Gp and Gy proteins may also be expressed 
therein. 

The effect of a compound on sweet or umami taste using cells or compositions 
30 that express or contain the above-identified receptors or receptor combinations may be 
determined by various means mcludmg the use of calcium-sensitive dyes, voltage- 
sensitive dyes, cAMP assays, direct binding assays using fluorescentiy labeled ligands 
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or radioactive Ugands such as ^H-glutamate/or transcriptional 
reporter such as luciferase or beta-lactamase). 

Assays that may be utilized with one or more TlRs according to the invention 
include by way of example, assays that utilize a genetic selection for living cells; assays 

5 that utilize whole cells or membrane fragments or purified TIR proteins; assays that 
utilize second messengers such as cAMP and IP3, assays that detect the translocation of 
airestin to the cell surface, assays that detect the loss of receptor expression on the cell 
surface (internalization) by tested ligands, direct ligand-binding assays, competitive- 
binding assays with inhibitors, assays usiag in vitro translated protein, assays that 

10 detect conformational changes upon the binding of a ligand (e.g., as evidenced by 
proteolysis, fluorescence, or NMR), behavioral assays that utilize transgenic non- 
human animals that express a TIR or TIR combination, such as flies, worms, or mice, 
assays that utilize cells infected with recombinant viruses that contain TIR genes. 

Also within the scope of the invention are structure-based analyses wherein the 

15 X-ray crystal stracture of a TIR or TIR fragment (or combination of TlRs, or a 
combination of a TIR with another protein) is determined and utilized to predict by 
molecular modehng techniques compounds that will bind to and/or enhance, mimic, 
block or modulate the particular TIR receptor or receptor combination- More 
particularly, the invention embraces the determination of the crystal structure of 

20 TlRinriRS (preferably hTlRl/hTlR3) and/or T1R2/T1R3 (preferably hTlR2/hTlR3) 
and the use of such crystal structures in structure-based design methods to identify 
molecules that modulate TIR recqptor activity. 

The invention especially includes biochemical assays conducted using cells, 
e.g., mammalian, yeast, insect or other heterologous cells that express one or more full 

25 length TIR receptors or fragments, preferably N-terminal domains of TlRl, T1R2 
and/or T1R3 . The effect of a compound in such assays can be determined using 
competitive binding assays, e.g., using radioactive glutamate or IMP, fluorescence 
(e.g., fluorescence polarization, FRET), or GTP7 ^^S binding assays. As noted, in a 
preferred embodiment, such assays will utilize cell lines that stably co-express 

30 T1R1/T1R3 or T1R2/T1R3 and a suitable G protein, such as Gais- Other appropriate G 
proteins include the chimeric and variant G proteins disclosed in U.S- Application 
Serial No. 09/984,292 and 60/243,770, incorporated by reference in their enturety 
herein. 
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Still further, altered receptors can be constructed and expressed having 
improved properties, e.g., enhanced surface expression or G-protein coupling. These 
TIR variants can be incorporated into cell-based and biochemical assa}^. 

It is envisioned that the present discoveries relating to human TlRs will extend 
5 to other species, e.g., rodents, pigs, monkeys, dogs and cats, and perhaps even non- 
mammals such as fish. In this regard, several fish TIR firagments are identified infira in 
Example 1. Therefore, the subject invention has application in screening for 
compoimds for use in animal feed formulations. 

The invention fiirther includes that utilize different allelic variants of various 

1 0 TlRs and combinations thereof, thereby enabling the identification of compounds that 
elicit specific taste sensation in individuals that express those allelic variants or 
compounds that elicit specific taste sensations in all individuals. Such compounds can 
be used to make foods more generally palatable. 

TIR encoding nucleic acids also provide valuable probes for the identification 

IS of taste cells, as the nucleic acids are specifically expressed in taste cells. For example, 
probes for TIR polypeptides and proteins can be used to identify taste cells present in 
foliate, circumvallate, and fungiform papillae, as well as taste cells present in the 
geschmackstreifen, oral cavity, gastrointestmal epithelium, and epiglottis. In particular, 
methods of detecting TlRs can be used to identify taste cells sensitive to sweet and/or 

20 umami taste stimuli or other taste stimuli representing other taste modalities. For 
example, cells stably or transiently expressing T1R2 and/or T1R3 would be predicted 
fi:om the work herein to be responsive to sweet taste stimuli. Similarly, cells expressing 
TlRl and/or T1R3 would be predicted to be responsive to umami taste stimuli. The 
nucleic acids encoding the TIR proteins and polypeptides of the invention can be 

25 isolated firom a variety of sources, genetically engineered, amplified, synthesized, 

and/or expressed recombinantly according to the methods disclosed in WO 00/035374, 
which is herein incorporated by reference in its entirety. A listing of TlRs that may be 
e>q)ressed according to the invention are provided in the Examples. However, it should 
be emphasized that the invention embraces the expression and use of other specific 

30 TlRs or firagments, variants, or chimeras congtmcted based on such TIR sequences, 
and particularly TlRs of other species. 

As disclosed, an important aspect of the invention is the plurality of methods of 
screening for modulators, e.g. , activators, inhibitors, stimulators, enhancers, agonists, 
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and antagonists, of these taste-cell-specific GPCRs« Such modulators of taste 
transduction are useful for the modulation of taste signaling pathways. These methods 
of screening can be used to identify high affinity agonists and antagonists of taste cell 
activity. These modulatory compounds can then be used in the food industry to 
5 customize taste, e.g. , to modulate the sweet and/or umami tastes of foods. 

This invention rectifies the previous lack of understanding relating to sweet and 
umami taste as it identifies specific TlRs and TIR receptor combinations that mediate 
sweet and imiami taste sensation. Therefore, in general, this application relates to the 
inventors' discoveries relating to the TIR class of taste-specific G-protein-coupled 

1 0 receptors and their specific function ui taste perception and the relationship of these 
discoveries to a better understanding of the molecular basis of taste. 

The molecular basis of sweet taste and umami taste - the savor of monosodium 
glutamate - is enigmatic. Recently, a three-member class of taste-specific G-protein- 
coupled receptors, termed TlRs, was identified. Overlapping TIR expression patterns 

1 S and the demonstration that the structurally related GAB Ab receptor is heterodimeric 
suggest that the TlRs fimction as heterodimeric taste receptors. In the examples infia, 
the present inventors describe the functional co-expression of human TlRl, T1R2, and 
T1R3 in heterologous cells; cells co-expressing TlRl and T1R3 are activated by 
imiami taste stimuU; cells co-expressing T1R2 and T1R3 are activated by sweet taste 

20 StimuU. T1R1/T1R3 and T1R2/T1R3 activity correlated witii psychophysical detection 
thresholds. In addition, the 5 '-ribonucleotide IMP was found to enhance the 
T1R1/T1R3 response to glutamate, a synergism characteristic of imiami taste. These 
findings demonstrate that specific TlRs and particularly different combinations of the 
TlRs function as sweet and umami taste receptors. 

25 Human perception of bitter, sweet, and umami is thought to be mediated by G- 

protein-coi5>led recq>tors (Lindemann, B., Physiol. Res. 76:718-66 (1996)). Recentiy, 
evaluation of the human genome revealed tibie T2R class of bitter taste receptors (Adler 
et aL, Cell 100:613-702 (2000); Chandrasgekar et al.. Cell 100:703-11 (2000); 
Matsunami et aL, Nature 404: 601-604 (2000)) but the receptors for sweet and umami 

30 taste have not been identified. Recentiy, another class of candidate taste receptors, tiie 
TlRs, was identified. The TlRs were first identified by large-scale sequencing of a 
subtracted cDNA library derived fiom rat taste tissue, which identified TlRl, and 
subsequently by TlRl-based degenerate PGR, which led to the identification of T1R2 
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(Hoon et al., Cell 96:541-55 1 (1999)). Recently, the present inventors and others 
identified a third and possibly final member of the TIR family, T1R3, in the human 
genome databank (Kitagawa et al., Biochem Biophys. Res Commun. 283(1): 236-42 
(2001); Max et al., Nat. Geiiet 28(1): 58-63 (2001); Sainz et al., J, Neurochem. 77(3): 
5 896-903 (2001); Montmayeur et al., Nat Neurosd. 4, 492-8. (2001)). Tellingly, mouse 
T1R3 maps to a genomic interval containing Sac, a locus that influmces sweet taste in 
the mouse (Fuller et al., 1 Hered. 65:33-6 (1974); Li et al, Mamm. Genome 12:13-16 
(2001)). Therefore, T1R3 was predicted to function as a sweet taste receptor. Recent 
high-resolution genetic mapping studies have strengthened the connection between 

10 mouse T1R3 and Sac (Fuller T.C., J. Hered. 65(1): 33-36 (1974); Li et al., Mammal. 
Genawe 12(1): 13-16 (2001)). 

Interestingly, all C-£amily receptors that have been functionally expressed thus 
fer - metabotropic glutamate receptors, the GABAb receptor, the calcium-sensing 
receptor (Conigrave, A. D„ Quinn, S. J. & Brown, E. M., Proc Natl Acad Sci USA 97, 

15 4814-9. (2000)), and a fish olfactory receptor (Speca, D. J. et al.. Neuron 23, 487-98. 
(1999)) - have been shown to be activated by amino acids. This conoonon feature raises 
the possibility that the TlRs recognize amino acids, and that the TlRs may be involved 
in the detection of glutamate in addition to sweet-tasting amino acids. Alternatively, a 
transcriptional variant of the mGluR4 metabotropic glutamate receptor has been 

20 proposed to be the umami taste receptor because of its selective expression in rat taste 
tissue, and the dmilarity of the rec^tor-activation threshold to the glutamate 
psychophysical detection threshold (Chaudhari et al., Nat. Neurosct 3:113-119 (2000)). 
This hypothesis is difficult to reconcile with the exceedingly low expression level of 
the mGluR4 variant in taste tissue, and the more or less unaltered glutamate taste of 

25 mGluR4 knockout mice (Chaudhari and Roper, Ann. NY. Acad. Sci. 855:398-406 

(1998)). Furthermore, the taste variant is structurally implausible, lacking not only the 
majority of the residues that form the glutamate-binding pocket of the wild-type 
receptor, but also approximately half of the globular N-terminal glutamate-binding 
domain (Kunishima et al., Nature 407:971-7 (2000)). 

30 Comparative analysis of TIR expression patterns in rodents has demonstrated 

that T1R2 and possibly TlRl are each coexpressed with T1R3 (Hoon et al.. Cell 
96:541-5 1 (1999); Kitagawa et al., Biochem Biophy. Res. Commun. 283:236-242 
(2001); Max et al., Nat Genet 28:58-63 (2001); Montmayeur et al., Nat Neurosci 
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4:492-8 (2001); Sainz et al., J. Neurochem 77:896-903 (2001)). Furthennore, 
dimerization is emergmg as a common theme of C-femily receptors: the metabotropic 
glutamate and calcium-sensing receptor are homodim^ (Romomano et al., J. Biol 
Chem. 271:28612-6 (1996); Okamoto et al., J.Biol Chem. 273: 13089-96 (1998); Han 
5 et al., J: Biol Chem. 274: 100008-13 (1999); Bai et al., J. Biol Chem. 273:23605-10 
(1998)), and the structurally related GABAb receptor is heterodimeric (Jones et al., 
Nature 396:674-9 (1998); Kaupmann et al.. Nature 396:683-687 (1998); White et al.. 
Nature 396: 679-682 (1998); Kuner et al. Science l%Z:lA-n (1999)). The present 
inventors have demonstrated by functional coexpression of TlRs in heterologous cells 
10 that human T1R2 functions in combination with human T1R3 as a sweet taste receptor 
and that human TlRl functions in combination with human T1R3 as an umami taste 
receptor. 

The discoveries discussed herein are especially significant, as previously the 
development of improved artificial sweeteners has been hampered by the lack of assays 

15 for sweet taste. Indeed, the five commonly used commercial artificial sweetraers, all of 
which activate hTlR2/hTlR3, were discovered serendipitously. Similarly, other than 
sensory testing, a laborious process, there is no assay for identifying compounds that 
modulate umami taste. These problems are now alleviated because, as established by 
experimental results discussed infra, the human sweet and umann receptors have been 

20 identified, and assays for tiiese receptors have been developed, particularly assays fliat 
use cells that stably express a functional TIR taste receptor, i.e. the sweet or umami 
taste receptor. 

Based thereon the invention provides assays for detecting and characterizing 
taste-modulating compounds, wherein TIR family members act, as they do in the taste 
25 bud, as reporter molecules for the effect on sweet and umami taste of taste-modulating 
compounds. Particularly provided and within the scope of the invention are assays for 
identifying compounds that modulate, mimic, enhance and/or block mdividually, sweet 
and umami tastes. Methods for assaying the activity of GPCRs, and especially 
compomids that affect GPCR activity are well known and are applicable to the TIR 

30 family member ofthe present invention and functional combmations thereof. Suitable 
assays have been identified supra. 
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The invention also provides compounds that bind TlRl, T1R2, T1R3, 
T1R2/T1R3 or T1R1/T1R3, or any fragment, portion, or subunit thereof, as disclosed 
throughout. 

In particular, the subject GPCRs can be used in assays to, e.g., measure changes 
5 in ligand bmding, ion concentration, membrane potential, current flow, ion flux, 
transcription, receptor-ligand interactions, second messenger concentrations, in vitro 
md in vivo. In another embodunent, TIR family members may be recombinantly 
expressed in cells, and the modulation of taste transduction via GPCR activity may be 
assayed by measuring changes in Ca^"^ levels and other intracellular messages such as 

10 cAMP,cGMP,orIP3. 

In certain assays, a domain of a TIR polypeptide, e.g., an extracellular, 
transmembrane, or intracellular domain, is fused to a heterologous polypeptide, thereby 
forming a chimeric polypeptide, e.g., sl chimeric protein with GPCR activity. 
Particularly contemplated is the use of fragments of TlRl, T1R2 or T1R3 containing 

15 the N-tenninalligand-binding domain. Suchproteins are useful, e.g., in assays to 
identify ligands, agonists, antagonists, or other modulators of TIR receptors. For 
example, a TIR polypeptide can be expressed in a eukaiyotic cell as a chimeric 
receptor with a heterologous, chaperone sequence that facilitates plasma membrane 
trafiBcking, or maturation and targeting through the secretory pathway. The optional 

20 heterologous sequence may be a PDZ domain-interacting peptide, such as a C-tenninal 
PDZDP fragment (SEQ ID NO 1). PDZIP is an ER export signal, which, according to 
the present invention, has been shown to faciUtate surface expression of heterologous 
proteins such as the TIR receptors described herein. More particularly, in one aspect of 
the invention, PDZIP can be used to promote proper targeting of probleinatic 

25 membrane proteins such as olfactory receptors, T2R taste receptors, and the TIR taste 
receptors described herein. 

Examples of such chimeric receptors include trans-species receptors. Any 
combination of receptor subunits from various species can be used together to form a 
chimeric receptor, which can then be used to identify tastants, for example. Therefore, 

30 contemplated herein is a chimeric T1R2/T1R3 receptor comprising a human T1R2 
subunit and a rat T1R3 subunit Also contenq>lated is a chimeric T1R2/T1R3 recq)tor 
comprising, a rat T1R2 subunit and a human T1R3 subunit. Also contemplated is a 
chimeric T1R2 receptor subunit comprising, a human extracellular domain, a rat 
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transmembrane domain and a rat intracellular domain (SEQ ID NOS: 16 and 17, for 
example). Also contemplated is chimeric T1R3 receptor subunit comprising, a rat 
extracellular domain, a human transmembrane domain and a human intracellular 
domain (SEQ ID NOS: 18 and 19, for example.) 

5 Such chimeric TIR receptors can be expressed in any eukaryotic cell, such as 

HEK-293 cells. Preferably, the cells contain a G protein, preferably a promiscuous G 
protein such as Gais or Gai6 or another type of promiscuous G protein capable of 
linking a wide range of GPCRs to an intracellular signaling pathway or to a signaling 
protein such as phospholipase C. Activation of such chimeric receptors in such cells 

1 0 can be detected using any standard method, such as by detecting changes in 

intracellular calcium by detecting FURA-2 dependent fluorescence in the cell. If 
preferred host cells do not express an ^^propriate G protein, they may be transfected 
with a gene encoding a prontiiscuous G protein such as those described in U.S. 
Application Serial No. 60/243,770, U.S. >^plication Serial No. 09/984,297, filed 

15 October 29, 2001, and U.S. Application Serial No. 09/989,497 filed November 21, 
2001 which are herein incorporated by reference in its entirety. 

Additional methods of assaying for modulators of taste transduction include in 
vitro Kgand-binding assays using: TIR polypeptides, portions thereof, i.e., the 
extracellular domain, transmembrane region, or combinations thereof, or chimeric 

20 proteins comprising one or more domains of a TIR family member; oocyte or tissue 
cxilture cells expressing TIR polypeptides, firagments, or fusion proteins; 
phosphorylation and dephosphorylation of T IR family members; G protein binding to 
GPCRs; ligand-binding assays; voltage, membrane potential and conductance changes; 
ion flux assays; changes in intracellular second messengers such as cGMP, cAMP and 

25 inositol triphosphate (IP3); and changes in intracellular calcium levels. 

Further, the invention provides methods of detecting TIR nucleic acid and 
protein expression, allowing investigation of taste transduction regulation and specific 
identification of taste receptor cells. TIR family members also provide useful nucleic 
acid probes for paternity and forensic investigations. TIR genes are also useful as 

30 nucleic acid probes for identifying taste receptor cells, such as foliate, fungiform, 

circumvallate, geschmackstreifen, and epiglottis taste recq)tor cells. TIR recqptors can 
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also be used to generate monoclonal and polyclonal antibodies useful for identifying 
taste receptor cells. 

Functionally, the TIR polypeptides comprise a family of related seven 
transmembrane G protein-coupled receptors, which are believed to be involved in taste 
5 transduction and may interact with a G protein to mediate taste signal transduction {see^ 
e,g., Fong, Cell Signal, 8:217 (1996); Baldwin, Curr Opin. Cell Biol, 6:180 (1994)). 
Structurally, the nucleotide sequences of TIR family members encode related 
polypeptides comprising an extracellular domain, seven transmembrane domains, and a 
cytoplasmic domain. Related TIR family genes from other species share at least about 

10 50%, and optionally 60%, 70%, 80%, or 90%, nucleotide sequence identity over a 
region of at least about 50 nucleotides in lengfli, optionally 100, 200, 500, or more 
nucleotides in length to the TIR nucleic acid sequences disclosed herein in the 
Examples, or conservatively modified variants thereof or encode polypeptides sharing 
at least about 35 to 50%, and optionally 60%, 70%, 80%, or 90%, amino acid sequence 

15 identity over an amino acid region at least about 25 anuno acids in length, optionally 50 
to 100 amino acids in length to a TIR polypeptide sequence disclosed infra in the 
Examples conservatively modified variants thereof 

Several consensus amino acid sequences or domains have also been identified 
that are characteristic of TIR family members. For example, TIR family members 

20 typically comprise a sequence having at least about 50%, optionally 55%, 60%, 65%, 
70%, 75%, 80%, 85%, 90%, 95-99%, or higher, identity to TIR consensus sequences 1 
and 2 (SEQ ID NOs. 2 and 3, respectively). These conserved domains thus can be used 
to identify members of the TIR family, by identity, specific hybridization or 
amplification, or specific binding by antibodies raised against a domain. TIR 

25 consensus sequences include by way of example the following sequences: 
TIR Family Consensus Sequence, 1 : (SEQ ID NO: 2) 
(TR)C(FL)(RQP)R(RT)(SPV)(VERKT)FL(AE)(WL)(RHG)E 
TIR Family Consensus Sequence 2: (SEQ ID NO: 3) 
(LQ)P(EGT)(NRQYN(RE)A(RK)(CGF)(VLI)T(FL)(AS)(N^^ 

30 These consensus sequences are inclusive of those found in the TIR 

polypeptides described herein, but TIR family members from other organisms may be 
expected to comprise consensus sequences having about 75% identity or more to the 
inclusive consensus sequences described specifically herein. 
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Specific regions of the TIR nucleotide and amino acid sequences may be used 
to identify polymorphic variants, interspecies homologs, and alleles of TIR family 
members. This identification can be made in vitro, e.g., under stringent hybridization 
conditions or PGR (eg., using primers encoding the TIR consensus sequences 
5 identified above), or by using the sequence information in a computer system for 
comparison with other nucleotide sequences. Different alleles of TIR genes within a 
single species population will also be useful in determining whether differences in 
allelic sequences control difierences in taste perception between members of the 
population. Classical PCR-type amplification and cloning techniques are useful for 
10 isolating new TlRs, for exaniple, where degenerate primers are sufiELcient for detecting 
related genes across species. 

Typically, identification of polymorphic variants and alleles of TIR family 
members can be made by comparing an amino acid sequence of about 25 amino acids 
or more, e,g., 50-100 amino acids. Amino acid identity of approximately at least 35 to 
15 50%, and optionally 60%, 70%, 75%, 80%, 85%, 90%, 95-99%, or above typically 
demonstrates that a protein is a polymorphic variant, interspecies homolog, or allele of 
a TIR family member. Sequence comparison can be performed using any of the 
sequence comparison algorithms discussed below. Antibodies that bind specifically to 
TIR polypeptides or a conserved region thereof can also be used to identify alleles, 
20 interspecies homologs, and polymorphic variants. 

Polymorphic variants, interspecies homologs, and alleles of TIR genes can be 
confirmed by examining taste-cell-specific expression of the putative TIR gene or 
protein. Typically, TIR polypeptides having an amino acid sequence disclosed herein 
can be used as a positive control in con^arison to the putative TIR polypeptide to 
25 demonstrate the identification of a polymoiphic variant or allele of the TIR family 
member. The polymorphic variants, alleles, and mterspecies homologs are expected to 
retain the seven transmembrane structure of a G protein-coupled receptor. For fiirflier 
detail, 5^6 WO 00/06592, which discloses related TIR family members, GPCR-B3s, 
the contents of which are herein incorporated by reference in a manner consistent with 
30 this disclosure. GPCR-B3 receptors are referred to herein as rTlRl and mTlRl . 
Additionally, see WO 00/06593, which also discloses related TIR family members, 
GPCR-B4S, the contents of which are herein incorporated by reference in a manner 
consistent with this disclosure. GPCR-B4 receptors are referred to herem as rTlR2 and 
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mTlR2. As discussed previously, fhe invention also includes structure-based assays 
flaat utilize the x-ray crystalline structure of a TIR or TIR combination, e.g., 
hTlR2yhTlR3 or hTlRl/hTlR3, to identify molecules that modulate TIR receptor 
activity, and thereby modulate sweet and/or umami taste. 
5 The present invention also provides assays, preferably high throughput assays, 

to identify molecules that enhance, mimic, block and/or modulate TIR receptors. In 
some assays, a particular domain of a TI R family member is used in combination with 
a particular domain of another TIR family member, e,g. , an extracellular, 
transmembrane, or intracellular domain or region. In other embodiments, an 

10 extracellular domain, transmembrane region or combination thereof may be bound to a 
solid substrate, and used, e.g. , to isolate ligands, agonists, antagonists, or any other 
molecules that can bind to and/or modulate the activity of a TIR polypeptide. 

Various conservative mutations and substitutions are envisioned to be within the 
scope of the invention. For instance, it is within the level of skill in the art to perform 

15 amino acid substitutions using known protocols of recombinant gene technology 

including PGR, gene cloning, site-directed mutagenesis of cDNA, transfection of host 
cells, and in-vitro transcription. The variants could then be screened for activity. 

Definitions 

20 As used herein, the following terms have the meanings ascribed to them imless 

specified otherwise. 

*Taste cells" include neuroepithelial cells that are organized into groups to 
form taste buds of the tongue, e.g., foliate, fungiform, and circumvallate cells (see, e.g., 
Roper et al, Ann. Rev. Neurosci. 12:329-353 (1989)). Taste cells are also found in the 

25 palate and other tissues, such as the esophagus and the stomach. 

"TIR'* refers to one or more members of a family of G protein-coupled 
receptors that are expressed in taste cells such as foliate, fungiform, and circumvallate 
cells, as well as cells of the palate, and esophagus (see, e.g, Hoon etal. Cell, 
96:541-551 (1999), herein incorporated by reference in its entirety). Members of this 

30 family are also referred to as GPCR-B3 and TRl in WO 00/06592 as well as GPCR-B4 
and TR2 in WO 00/06593. GPCR-B3 is also herein referred to as rTlRl, and GPCR- 
B4 is referred to as rTlR2. Taste receptor cells can also be identified on the basis of 
morphology (see, e.g.. Roper, supra), or by the expression of proteins specifically 
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expressed in taste cells. TIR family members may have the ability to act as receptors 
for sweet taste transduction, or to distinguish between various other taste modalities. 
Representative TIR sequences, including hTlRl, hTlR2 and hTlR3 are identified 
infira in the examples. 

5 'TIR" nucleic acids encode a family of GPCRs with seven transmembrane 

regions that have "G protein-coupled receptor activity," eg. , they may bind to G 
proteins in response to extracellular stimuli and promote production of second 
messengers such as P3, cAMP, cGMP, and Ca^"^ via stimulation of enzymes such as 
phospholipase C and adenylate cyclase (for a description of the structure and function 
10 of GPCRs, see, e.g. , Fong, supra, and Baldwin, supra). A single taste cell may contain 
many distinct TIR polypeptides. 

The term *T1R" family therefore refers to polymorphic variants, alleles, 

mutants, and interspecies homologs that: (1) have at least about 35 to 50% amino acid 

I 

sequence identity, optionally about 60, 75, 80, 85, 90, 95, 96, 97, 98, or 99% amino 
15 acid sequence identity to a TIR polypeptide, preferably those identified in Example 1, 
over a window of about 25 amino acids, optionally 50-100 amino acids; (2) specifically 
bind to antibodies raiised against an immunogen conq)rising an amino acid sequence 
preferably selected fi:om the group consisting of the TIR polypeptide sequence 
disclosed in Example 1 and conservatively modified variants thereof; (3) are encoded 
20 by a nucleic acid molecule which specifically hybridize (with a size of at least about 
100, optionally at least about 500-1000 nucleotides) under stringent hybridization 
conditions to a sequence selected from the group consisting of the TIR nucleic acid 
sequences contained in Example 1, and conservatively modified variants thereof; or (4) 
comprise a sequence at least about 35 to 50% identical to an amino acid sequence 
25 selected firom the group consisting of the TIR amino acid sequence identified in 
Example 1. 

Topologically, the TIRs disclosed herein have an **N-teraiinal domain" also 
called "extracellular domain" comprising a "venus flytrap domain" and a "cysteine rich 
domain;" "transmembrane domains" coniprising seven transmembrane regions, and 
30 corresponding cytoplasmic, and extracellular loops; and a "C-teiminal domain" {see^ 
e,g., Hoon et aL, Cell, 96:541-551 (1999); Buck & Axel, Cell, 65:175-187 (1991)). 
These domains have been structurally identified using methods known to those of skill 
in the art, such as sequence analysis programs that identify hydrophobic and 
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hydrophiUc domains (Stryer, Biochemistry, (3rd ed. 1988). Such domains are useful for 
making chimeric proteins and for in yitro assays of the invention, e.g., Ugand binding 
assays. Thespecificbindingof a compound to these stnicturaUy defined doniains 
provides provides structural definition for the compound. 

5 **Extracellular domains" therefore refers to the domains of TIR polypeptides 

that protrude ftom the cellular membrane and are exposed to the extracellular face of 
the cell. Such domains generally include the terminal domain" that is exposed to 
the extracellular face of the cell, and optionally can include portions of the extracellular 
loops of the transmembrane domain that are exposed to the extracellular face of the 

10 cell, the loops between transmembrane regions 2 and 3, between transmembrane 
regions 4 and 5, and between transmembrane regions 6 and 7. 

The ''N-terminal domain" region starts at the N-terminus and extends to a 
region close to the start of the first transmembrane domain. More particularly, in one 
embodiment of the invention, this domain starts at the N-terminus and ends 

15 approximately at the conserved glutamic acid at ammo acid position 563 plus or minus 
approximately 20 amino acids. These extracellular domains are useful for in vitro 
ligand-binding assays, both soluble and solid phase. In addition, transmembrane 
regions, described below, can also bind ligand either in combination with the 
extracellular domain, and are therefore also useful for in vitro ligand-binding assays. 

20 "Cysteine-rich domain" refers to the domain of the polypeptides. This 

conserved sequence contains several highly-conserved Cys residues that form 
disulphide bridges, and lies outside the ceU membrane.. This region corresponds to tiie 
domain of the TIR family members and is found in all three subunits, T1R1-T1R3. The 
cysteine rich sequence is found m amino acids 510-566 of TlRl, 508-565 of T1R2, and 

25 512-568 or T1R3. 

"Transmembrane domain," which comprises the seven "transmembrane 
regions," refers to the domain of TIR polypeptides that lies within the plasma 
membrane, and may also include the corresponding cytoplasmic (intracellular) and 
extracellular loops. In one embodiment, this region corresponds to the domam of TIR 

30 family members which starts approximately at the conserved glutamic acid residue at 
ammo acid position 563 plus or minus 20 amino acids and ends ^proximately at the 
conserved tyrosine amino acid residue at position 812 plus or minus approximately 10 
amino acids, the seven transmembrane regions and extracellular and cytoplasmic loops 
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can be identified using standard methods, as described in Kyte & Doolitfle, /. Moh 
BioU 157:105-32 (1982)), or in Stryer, svpra. 

"Cytoplasmic domains" refers to the domains of TIR polypeptides that face the 
inside of the cell, the "C-tenninal domain" and the intracellular loops of the 
5 transmembrane domain, the intracellular loop between transmembrane regions 1 
and 2, the intracellular loop between transmembrane regions 3 and 4, and the 
intracellular loop between transmembrane regions 5 and 6. 

"C-terminal domain" refers to the region that spans the end of the last 
transmembrane domain and the C-terminus of the protein, and which is normally 

10 located within the cytoplasm. In one embodiment, this region starts at the conserved 
tyrosine amino acid residue at position 812 plus or minus approximately 10 amino 
acids and continues to the C-terminus of the polypeptide. 

The term "ligand-binding region" or *Tigand-binding domain" refers to 
sequences derived from a taste recqptor, particularly a taste recq>torttmt5ubstantiaUy . . 

15 incorporates at least the extracellular domain of the receptor. In one embodiment, the 
extracellular domain of the ligand-binding region may include the N-tenninal domain 
and, optionally, portions of the transmembrane domain, such as &e extracellular loops 
of the transmembrane domain. The ligand-binding region may be capable of binding a 
ligand, and more particularly, a compound fliat enhances, mimics, blocks, and/or 

20 modulates taste, e.g., sweet or umami taste. 

The phrase **heteromultimer'* or *1ieteromultimeric complex" in the context of 
the TIR recq)tors or polypeptides of the invention refers to a functional association of 
at least one TIR receptor and another receptor, typically another TIR receptor 
polypeptide (or, altematively another non-TlR receptor polypeptide). For clarity, the 

25 functional co-dependence of the TlRs is described in this application as reflecting their 
possible function as heterodimeric taste receptor complexes. However, as discussed 
previously, functional co-dependence may altematively reflect an indirect interaction. 
For example, T1R3 may function solely to facilitate surfece expression of TlRl and 
T1R2, which may act independently as taste receptors. Alternatively, a functional taste 

30 receptor may be comprised solely of T1R3, which is differentially processed under the . 
control of TlRl or T1R2, analogous to RAMP-dependent processing of the calcium- 
related receptor. 
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The phrase "functional effects" in the context of assays for testing compounds 
that modulate TIR family member mediated taste transduction includes the 
determination of any parameter that is indirectly or directly under the influence of the 
receptor, e.g., fimctional, physical and chemical effects. It includes Ugand binding, 
5 changes in ion flux, membrane potential, current flow, transcription, G protein binding, 
GPCR phosphorylation or dephosphorylation, conformation change-based assays, 
signal transduction, receptor-ligand interactions, second messenger concentrations (eg. , 
cAMP, cGMP, IP3, or intracellular Ca^^, i/i vitro, in vivo, and ex vivo and also includes 
other physiologic effects such increases or decreases of neurotransmitter or hormone 
10 release. 

By ^'detenndning the functional effecf ' in the context of assays is meant assays 
for a compound that increases or decreases a parameter that is indirectly or directly 
under the influence of a TIR family member, e.g., functional, physical and chemical 
effects. Such fimctional effects can be measured by any means known to those skilled • 

15 in the art, e.g. , changes in spectroscopic characteristics (eg, fluorescence, absorbency, 
refractive index), hydrodynamic (eg, shape), chromatographic, or solubility properties, 
patch clamping, voltage-sensitive dyes, whole cell currents, radioisotope efflux, 
inducible markers, oocyte TIR gene expression; tissue culture cell TIR expression; 
transcriptional activation of TIR genes; ligand-binding assays; voltage, membrane 

20 potential and conductance changes; ion flux assays; changes in intracellular second 
messengers such as cAMP, cGMP, and inositol triphosphate (IPS); changes in 
intracellular calcium levels; neurotransmitter release, conformational assays and the 
like. 

A "flavor or tastant" herein refers to a compound or biologically acceptable salt 
25 thereof that induces, in a subject, the perception of smell and/or taste, which include 
sweet, sour, salty, bitter and umami, and others. The subject can be human, animals, 
and/or a biological assay, such as tihe ones described and cited in this application. 

A "flavor or taste modifier** herein refers to a compound or biologically 
acceptable salt thereof that modulates, including enhancing or potentiating, inhibiting, 
30 and inducing, the smell and/or tastes of a natural or synthetic tastants in a subject 
A "flavor or taste enhancer^* herein refers to a compoimd or biologically 
acceptable salt thereof that enhances the tastes or smell of a natural or synthetic 
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I 

tastants, e.g., monosodium glutamate (MSG) for umami taste and fructose for sweet 
taste. 

"Umami tastanf * or '*umami compound" herein refers to a compound or 
biologically acceptable salt thereof that elicits a detectable umami taste m a subject, 
5 e.g-.,MSG. 

"Sweet tastanf or "sweet compound" herein refers to a compound or 
biologically acceptable salt thereof that elicits a detectable sweet taste in a subject, 
fructose. 

An "umami taste modifier"' herein refers to a compound or biologically 
10 acceptable salt thereof that modulates, including enhancing or potentiating, inhibiting, 
and inducing, the umami taste of a natural or synthetic umami tastants, e.g., 
monosodium glutamate (MSG) in a subject 

A "sweet taste modifier'' herem refers to a compound or biologically acceptable 
salt thereof that modulates, including enhancing or potentiating, inhibiting, and 
15 mducing, the sweet taste of a natural or synthetic sweet tastants, e.g., fructose, in a 
subject. 

A **taste enhancing amount" herein refers to an amount of a compound that is 
sufficient to enhance the taste of a natural or synthetic tastants, e.g., monosodium 
glutamate (MSG) for umami taste or fiuctose for sweet taste. 

20 "Wet Soup Category" means wet/liquid soups regardless of concentration or 

container, including frozen Soups. For the purpose of this definition soup(s) means a 
food prepared from meat, poultry, fish, vegetables, grains, fiiiit and other ingredients, 
cooked in a liquid which may include visible pieces of some or all of these ingredients. 
It may be clear (as a broth) or thick (as a chowder), smooth, pureed or chunky, ready- 

25 to-serve, semi-condensed or condensed and may be served hot or cold, as a first course 
or as the mam course of a meal or as a between meal snack (sipped like a beverage). 
Soup may be used as an ingredient for preparing other meal components and may range 
. from broths (consomm6) to sauces (cream or cheese-based soups). 

*T)ehydrated and Culinary Food Category*' means: (i) Cooking aid products 

30 such as: powders, granules, pastes, concentrated liquid products, including concentrated 
bouillon, bouillon and bouillon like products in pressed cubes, tablets or powd^ or 
granulated form, which are sold separately as a finished product or as an ingredient 
within a product, sauces and recipe mixes (regardless of technology); (ii) Meal 
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solutions products such as: dehydrated and freeze dried soups, including dehydrated 
soup mixes, dehydrated instant soups, dehydrated ready-to-cook soups, dehydrated or 
ambient preparations of ready-made dishes, meals and single serve entries including 
pasta, potato and rice dishes; and (iii) Meal embellishment products such as: 
5 condiments, marinades, salad dressings, salad toppings, dips, breading, batter mixes, 
shelf stable spreads, barbecue sauces, liquid recipe mixes, concentrates, sauces or sauce 
mixes, including recipe mixes for salad, sold as a finished product or as an ingredient 
within a product, whether dehydrated, liquid or frozen. 

'"Beverage Category" means beverages, beverage mixes and concentrates, 
10 including but not limited to, alcoholic and non-alcoholic ready to drink and dry 

powdered Other examples of foods and beverages wherein compoimds according 
to the invention may be incorporated included by way of example carbonated and non- 
carbonated beverages, e.g. , sodas, juices, alcoholic and non-alcoholic beverages, 
confectionary products, e.g., cakes, cookies, pies, candies, chewing gums, gelatins, ice 
IS creams, soifoets, puddings, jams, jellies, salad dressings, and other condiments, cereal, 
and other breakfast foods, canned fruits and fruit sauces and the like. 

Additionally, the subject compounds can be used in flavor preparations to be 
added to foods and beverages. In preferred instances the composition will comprise 
another flavor or taste modifier such as a sweet tastant. 
20 In some instances biologically acceptable salts of the subject compounds may 

be used. Examples of such salts include alkali and earth metal salts, organic salts, and 
the like. Specific examples include potassium, sodium, calcium and magnesium salts, 
hydrochloric or sulfuric acid salts, ethanolamine salts, and the like. The salt will be 
selected such that it is biologically safe for ingestion and does adversely affect the 
25 sweet taste modulatory properties of the compound. 

As used herein, the tenn "medicinal product" includes both soUds and liquids 
which are ingestible non-toxic materials which have medicinal value such as cough 
syrups, cough drops, aspirin and chewable medicinal tablets. An oral hygiene product 
includes solids and liquids such as tootlq)ast6 or mouthwash. 
30 A "comestibly or medicinally acceptable carrier or exdpient" is a medium that 

is used to prepare a desired dosage form of the inventive compound A comestibly or 
medicmally acceptable carrier includes solvents, diluents, or other liquid vehicle; 
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dispersion or suspension aids; surface active agents; isotonic agents; thickening or 
emulsifying agents, preservatives; solid binders; lubricants and the like. 

•Inhibitors," "activators," "enhancers" and **modulators" of TIR genes or 
proteins are used to refer to inhibitory, activating, enhancing or modulating molecules 
5 identified using in vitro and in vivo assays for taste transduction, e.g., ligands, agonists, 
antagonists, and their homologs and mimetics. 

Inhibitors are compounds that, e.g, , bind to, partially or totally block 
stimulation, decrease, prevent, delay activation, inactivate, desensitize, or down 
regulate taste transduction, e.g^., antagonists. Activators and enhancers are compounds 
10 that, e.g.y bind to, enhance, stimulate, increase, open, activate, facilitate, enhance 
activation, sensitize, or up regulate taste transduction, e.g,, agonists. Modulators 
include compounds that, e.g, alter the interaction of a receptor with: extracellular 
proteins that bind activators or inhibitor (eg., ebnerin and other members of the 
hydrophobic carrier family); G proteins; kinases (e.g., homologs of rhodopsin kinase 
1 5 and beta adrenergic receptor kinases that are involved in deactivation and 

desensitization of a receptor); and arrestins, which also deactivate and desensitize 
receptors. Modulators can include genetically modified versions of TIR family 
members, e.g., with altered activity, as well as naturally occurring and synthetic 
ligands, antagonists, agonists, small chemical molecules and the like. Such assays for 
20 inhibitors and activators include, e.g., expressing TIR family members in cells or cell 
membranes, applying putative modulator compounds, in the presence or absence of 
tastants, e,g, sweet tastants, and then determining the functional effects on taste 
transduction, as described above. Samples or assays comprising TIR family members 
that are treated with a potential enhancer, activator, inhibitor, or modulator are 
25 compared to control samples without the inhibitor, activator, or modulator to examine 
the extent of modulation. Positive control samples (e.g. a sweet tastant without added 
modulators) are assigned a relative TIR activity value of 100%. 

^'ECso" is defined as the amount of a compound that elicits 50% of the moYma] 
response the compound can elicit, whether as an activator, enhancer, or modulator. A 
30 dose-dependent response curve was determined for a compound^ and the compound 
concentration corresponding to 50% of tiie maximal response was derived firom the 
curve, in one example. 
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*TCso* is defined as the amount of a compound that elicits 50% of the maxinial 
effect the compound can elicit as an inhibitor. 

Regarding sweet tastants and chancers, after a compound is identified, scoies 
of their activities are given as percentage of the maximum fiiictose intensity (%). In 
5 compound dose response, an EC50 can be calculated to reflect the potency of the 
compound as a sweet agonist. In the present invention, an EC50 of lower than about 
100 mM is indicative of compounds that induce T1R2/T1R3 activity as a sweet agonist. 
Preferably, a positive hit for a sweet agonist has an EC50 value of less than about 1 mM; 
more preferably less than about a 10 ^iM. 
10 In sweet enhancement assay experiments, a finctose dose response was run and 

a second fructose dose response was run with a certain amount of candidate compound 
at every fiructose concentrations at the same time. Then, the EC50 ratio can be 
calculated based on tiie following definitions: 

EC50 Ratio = EC50 (fiiictose)/EC5o (fiuctose + [Compound]) 
15 ' wherein "[compound]" refers to the concentration of compound used to elicit 

(or enhance or potentiate) the fiructose dose response. Those concentrations could vary 
&om a pM to an mM, more preferred, firom a low nM to |iM. A potent sweet enhancer 
would have a high EC50 Ratio at a low concentration of the compound used. 

hi tiie present invention, an EC50 ratio of greater than 1 is indicative of a 
20 compound tibat modulates (potentiates) T1R2/T1R3 activity and is an sweet enhancer. 
Preferably, a positive hit will have EC50 ratio values of at least 1.20, preferably ranging 
from at least 1.50 to 100 or even higher. 

By contrast, competing agonists (those sweet tastants that bind mutually 
exclusively) or inhibitors always yield values of EC50 ratio less than 1, such as from 0- 
25 1. 

Regarding umami tastants and enhancers, scores of their activities can be given 
as percentage of the maximum MSG mtensity (%). In compound dose response, an 
EC50 can be calculated to reflect the potency of the contpound as umami agonist In the 
present invention, an EC50 of lower than about 10 mM is indicative of compounds that 
30 mduce T1R1/T1R3 activity and an umami agonist Preferably, a positive hit for an 
umami agonist will have EC50 values of less than about 1 mM; more preferably rangmg 
from about a pM to about a low ^M. 
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In enhancement assay experiments, a MSG dose response was run and a second 
MSG dose response was run with a certain amount of candidate compound at every 
MSG concentrations at the same time. Then, tiie ECso i^tio is calculated based on the 
following definitions: 
5 EC50 Ratio = EC50 (MSG)/EC5o (MSG + [Compound]) 

wherein "[compound]" refers to the concentration of compound used to elicit 
(or enhance or potentiate) the MSG dose response. Those concentrations can vary 
from a pM to an niM, more preferred, from a low nM to ^M. A potent umami 
enhancer has a high EC50 Ratio at a low concentration of the compound used 

.10 In the present invention, an EC50 ratio of greater than 1 is indicative of a 

compound that modulates (potentiates) T1R1/T1R3 activity and in an umami enhancer. 
Preferably, a positive hit has EC50 ratio values of at least 1 .20, preferably ranging from 
at least l.SO to 100 or even higher. 

Negative control samples (e.g. buffer without an added taste stimulus) are 

IS assigned a relative TIR activity value of 0%. Inhibition of a TIR is achieved when a 
mixture of the positive control sample and a modulator result in the TIR activity value 
relative to the positive control is about 80%, optionally 50% or 25-0%. Activation of a 
TIR by a modulator alone is achieved when the TIR activity value relative to the 
positive control sample is 10%, 25%, 50%, 75%, optionally 100%, optionally 150%, 

20 optionally200-500%, or 1000-3000% higher. 

The terms "purified," "substantially purified," and ^Isolated" as used herein 
refer to the state of being free of other, dissimilar compoimds with which the compound 
of the invention is normally associated in its natural state, so that the **purified," 
"substantially purified," and "isolated" subject comprises at least 0.5%, 1%, 5%, 10%, 

25 or 20%, and most preferably at least 50% or 75% of the mass, by weight, of a given 
sample. In one preferred embodiment, these terms refer to the compound of the 
invention comprising at least 95% of the mass, by weigjit, of a given sample. As used 
herem, the terms 'purified," "substantially purified," and "isolated," when referring to a 
nucleic acid or protein, also refers to a state of purification or concentration different 

30 than that which occurs naturally in the mammalian, especially human body. Any 
degree of purification or concentration greater than that which occurs naturally in the 
mammalian, especially human, body, including (1) the purification from other 
associated structures or compounds or (2) the association witii structures or compounds 
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to which it is not normally associated in the mammalian, especially human, body, are 
withm the meaning of "isolated." The nucleic acid or protein or classes of nucleic acids 
or proteins, described herein, may be isolated, or otherwise associated with structures 
or compounds to which they are not normally associated in nature, according to a 
5 variety of methods and processes known to those of skill in the art. 

The term '"nucleic acid" or "nucleic acid sequence" refers to a deoxy- 
ribonucleotide or ribonucleotide oligonucleotide in either single- or double-stranded 
form. The term encompasses nucleic acids, i.e., oligonucleotides, containing known 
analogs of natural nucleotides. The term also encompasses nucleic-acid-like structures 
10 with synthetic backbones {see eg.. Oligonucleotides and Analogues, a Practical 

Approach, ed. F. Eckstein, Oxford Univ. Press (1991); Antisense Strategies, Annals of 
the K Y. Academy of Sciences, Vol. 600, Eds. Baserga et al (NYAS 1992); Milligan J. 
Med. Chem. 36:1923-1937 (1993); Antisense Research and Applications (1993, CRC 
Press), WO 97/0321 1; WO 96/39154; Mata, ToxicoL Appl Pharmacol 144:189-197 ■ 
15 (1997); Strauss-Soukup, Biochemistry 36:8692-8698 (1997); Samstag, Antisense 
Nucleic Acid Drug Dev, 6:153-156 (1996)). 

Unless otherwise indicated, a particular nucleic acid sequence also implicitly 
encompasses conservatively modified variants thereof {e,g, , degenerate codon 
substitutions) and complementary sequences, as well as the sequence explicitly 
20 indicated. Specifically, degenerate codon substitutions may be achieved by generating, 
e,g, sequences in which the third position of one or more selected codons is substituted . 
with mixed-base and/or deoxyinosine residues (Batzer et al , Nucleic Acid Res. , 
19:508 1 (1991); Ohtsuka et al, X Biol Chem., 260:2605-2608 (1985); Rossolini et al, 
Mol Cell Probes, 8:91-98 (1994)). The term nucleic acid is used interchangeably witii 
25 gene, cDNA, mRNA, oligonucleotide, and polynucleotide. 

The terms ^polypeptide," "peptide" and ^'protein" are used interchangeably 
herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an artificial chemical mimetic of 
a corresponding naturally occurring amino acid, as well as to naturally occurring amino 
30 acid polymers and non-naturally occurring amino acid polymer. 

The term "plasma membrane translocation domain" or simply "translocation 
domain" means a polypeptide domain that, when incorporated into a polypeptide 
coding sequence, can with greater efficiency "chaperone" or ^^translocate" the hybrid ' 
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("fusion") protein to the cell plasma membrane than without the domain. For instance, 
a '^translocation domain" may be derived from the amino terminus of the bovine 
rhodopsin receptor polypeptide, a 7-transmembrane receptor. However, rhodopsin 
from any mammal may be used, as can other translocation facilitating sequences. Thus, 
5 the translocation domain is particularly efficient in translocatmg T-transmembrane 
fusion proteins to the plasma membrane, and a protein (e.g, , a taste receptor 
polypeptide) comprising an amino terminal translocating domain will be transported to 
the plasma membrane more efficiently than without the domain. However, if the N- 
terminal domain of the polypeptide is active in binding, as with the TIR receptors of 
10 the present invention, the use of other translocation domains may be preferred. For 
instance, a PDZ domain-interacting peptide, as described herein, may be used. 

The '^translocation domain," "ligand-binding domain", and chimeric receptora 
compositions described herein also include "analogs," or "conservative variants" and 
'Wmetics" C'peptidomimetics*') with structures and activity that substantially 
1 5 correspond to the exemplary sequences. Thus, the terms "conservative variant or 
"analog" or "mimetic" refer to a polypq)tide which has a modified amino acid 
sequence, such that the change(s) do not substantially alter the polypeptide's (the 
conservative variant's) structure and/or activity, as defined herein. These include 
conservatively modified variations of an amino acid sequence, amino acid 
20 substitutions, additions or deletions of those residues that are not critical for protein 
activity, or substitution of amino acids with residues having similar properties (e.g,, 
acidic, basic, positively or negatively charged, polar or non-polar, etc.) such that the 
substitutions of even critical amino acids does not substantially alter stracture and/or 
activity. 

25 More particularly, "conservatively modified variants" applies to both amino 

acid and nucleic acid sequences. With respect to particular nucleic acid sequences, 
conservatively modified variants refers to those nucleic acids which encode identical or 
essentially identical amino acid sequences, or where the nucleic acid does not encode 
an amino acid sequence, to essentially identical sequences. Because of the degeneracy 

30 of the genetic code, a large number of fimctionally identical nucleic acids encode any 
given protein. 

For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid 
alanine. Thus, at every position where an alanine is specified by a codon, flie codon can 
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be altered to any of the corresponding codons described without altering the encoded 
polypeptide. 

Such nucleic acid variations are "silent variations/' which are one species of 
conservatively modified variations. Every nucleic acid sequence herein, which encodes 
5 a polypeptide, also describes every possible silent variation of the nucleic acid. One of 
skill will recognize that each codon m a nucleic acid (except AUG, which is ordinarily 
the only codon for methionine, and TGG, which is ordinarily the only codon for 
tryptophan) can be modified to yield a fimctionally identical molecule. Accordingly, 
each silent variation of a nucleic acid, which encodes a polypeptide, is implicit in each 

1 0 described sequence. 

Conservative substitution tables providing fimctionally similar amino acids are 
well known in the art. For example, one exemplary guideline to select conservative 
substitutions includes (original residue followed by exemplary substitution): ala/gly or > 
ser; arg/lys; asn/gki or his; asp/glu; cys/ser; ghi/asn; gly/asp; gly/ala or pro; his/asn or 

15 ghi; ile/leu or val; leu/ile or val; lys/arg or ghi or glu; met/leu or tyr or ile; phe/met or 

leuortyr, ser/thr;thr/ser;tip/tyr,tyr/trporphe;val/ileorleu. An alternative exemplary . 
guideline uses the following six groups, each containing amino acids that are 
conservative substitutions for one another: 1) Alanine (A), Serine (S), Threonine (T); 2) 
Aspartic acid (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine 

20 (R), Lysine (I); 5) Isoleucine (I), Leucine (L), Methiomne (M), Valine (V); and 6) 

Phenylalanine (F), Tyrosine (Y), Tryptophan (W); {see also, e.g., Creighton, Proteins, 
W.H. Freeman and Company (1984); Schultz and Schimer, Principles of Protein 
Structure, Springer-Vrlag (1979)). One of skill in the art will appreciate that the above- 
identified substitutions are not the only possible conservative substitutions. For 

25 example, for some purposes, one may regard all charged amino acids as conservative 
substitutions for each other whether they are positive or negative. Li addition, 
individual substitutions, deletions or additions that alter, add or delete a single amino 
acid or a small percentage of amino acids in an encoded sequence can also be 
considered "conservatively modified variations," 

30 The temis "mimetic" £md **peptidomimetic" refer to a synthetic chemical 

compound that has substantially the same structural and/or fimctional characteristics of 
the polypeptides, e,g. , translocation domains, ligand-bindmg domains, or chimeric 
receptors of tiie invention. The mimetic can be either entirely composed of synthetic. 
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non-natural analogs of amino acids, or may be a chimeric molecule of partly natural 
pq)tide amino acids and partly non-natural analogs of amino acids. The mimetic can 
also incorporate any amount of natural amino acid conservative substitutions as long as 
such substitutions also do not substantially alter the mimetic's structure and/or activity, 

5 As with polypeptides of tiie invention which are conservative variants, routme 

experimentation will detennine whether a mimetic is within the scope of the invention, 
le, , that its structure and/or function is not substantially altered. Polypeptide mimetic 
compositions can contain any combination of non-natural structural components, which 
are typically from three structural groups: a) residue linkage groups other than the 

10 natural amide bond C'peptide bond") linkages; b) non-natural residues in place of 
naturally occurring amino acid residues; or c) residues which induce secondary 
structural mimicry, i.e., to induce or stabilize a secondary structure, e.g., a beta turn, 
gamma turn, beta sheet, alpha helix conformation, and the like. A polypeptide can be 
characterized as a mimetic when all or some of its residues are joined by chemical 

15 means other than natural peptide bonds. Individual peptidomimetic residues can be 
joined by peptide bonds, other chemical bonds or coupling means, such as, e,g.^ 
glutaraldehyde, N-hydroxysuccinimide esters, bifimctional maleimides, NJJ'- 
dicyclohexylcarbodiimide (DCC) or N,N'-diisopropylcarbodiimide (DIQ. Linking 
groups that can be an alternative to the traditional amide bond C^eptide bond**) 

20 linkages include, ketomethylene (e.g., -C{=0)-CH2- for -C(=0)-NH-), 

aminomethylene (CH2-NH), ethylene, olefin (CH=CH), ether (CH2-O), fliioether (CH2- 
S), tetrazole (CN4), thiazole, retroamide, or ester (see, e.g., Spatola, Chemistry and 
Biochemistry of Amino Acids, Peptides and Proteins, Vol. 7, pp 267-357, "Peptide 
Backbone Modifications," MarcellDekker, NY (1983)). A polypeptide can also be 

25 characterized as a mimetic by containing all or some non-natural residues in place of 
naturally occurring amino acid residues; non-natural residues are well described in the 
scientific and patent literature. 

A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, or chemical means. For example, useful 

30 labels include ^^P, fluorescent dyes, electron-dense reagents, enzymes (e.g., as 

commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins which can 
be made detectable, e.g. , by incorporating a radiolabel into the pq)tide or used to detect 
antibodies specifically reactive with the peptide. 
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A "labeled nucleic acid piobe or oligonucleotide" is one that is bound, either 
covalently, through a linker or a chemical bond, or noncovalently, Enough ionic, van 
der Waals, electrostatic, or hydrogen bonds to a label such that the presence of the 
probe may be detected by detecting the presence of the label bound to the probe. 

5 As used herein a '"nucleic add probe or oligonucleotide" is defined as a nucleic 

acid capable of binding to a target nucleic acid of^complementary sequence through one 
or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, 
G, C, or T) or modified bases (7-deazaguanosine, inosine, eta). In addition, the bases in 

10 a probe may be joined by a linkage other than a phosphodiester bond, so long as it does 
not interfere with hybridization. Thus, for example, probes may be peptide nucleic ^ 
acids in which the constituent bases are joined by peptide bonds rather than 
phosphodiester linkages. It will be understood by one of skill in the art tiiat probes may 
bind target sequences lacking complete complementarity with the probe sequence 

15 depending upon the stringency ofthe hybridization conditions. The probes are 

optionally directiy labeled as with isotopes, chromophores, lumiphores, chromogens, or 
indirectly labeled such as with biotin to which a streptavidin complex may later bind. 
By assaying for the presence or absence ofthe probe, one can detect the presence or 
absence of the select sequence or subsequence. 

20, The term "heterologous" when used wifli reference to portions of a nucleic acid 

indicates that the nucleic acid comprises two or more subsequences that are not found 
in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinanfly produced, havinjg two or more sequences fi'om unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter firom one source and a 

25 coding region from another source. Sinailarly, a heterologous protein indicates that the 
protein comprises two or more subsequences that are not found in the same relationship 
to each other in nature (e.g. , a fiision protein). 

A "promoter** is defined as an array of nucleic acid sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 

30 acid sequences near the start site of transcription, such as, in the case of a polymerase 11 
type promoter, a TATA element. A promoter also optionally includes distal enhancer 
or repressor elements, which can be located as much as several thousand base pairs 
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&om the start site of transcription. A "constitutive*'promoter is a pnffiioler that is active 
under most myiromnental and developmental conditions. 

An "inducible" promoter is a promoter that is active under environmental or 
developmental regulation. The term "operably linked" refers to a functional linkage 
5 between a nucleic acid expression control sequence (such as a promoter, or array of 
transcription factor binding sites) and a second nucleic acid sequence, wherein tiie 
e3q)ression control sequence directs transcription of the nucleic acid corresponding to 
the second sequence. 

As used herein, 'Recombinant" refers to a polynucleotide synthesized or 
10 otherwise manipulated in vitro (e.g., "recombinant polynucleotide"), to methods of 

using recombinant polynucleotides to produce gene products in cells or other biological 
systems, or to a polypeptide ("recombinant protein") encoded by a recombinant 
polynucleotide. "Recombinant means" also encon5)ass the ligation of nucleic acids 
haiving various coding regions or domains or promoter sequences from different 
1 5 sources into an expression cassette or vector for expression of, e.g., inducible or 
constitutive expression of a fusion protein comprismg a translocation domain of the 
invention and a nucleic acid sequence amplified using a primer of tiie invention. 

As used herein, a "stable cell line" refers to a cell line, which stably, i.e. over a 
prolonged period, expresses a heterologous nucleic sequence, i.e. a TIR or G protein. 
20 In preferred embodiments, such stable cell lines will be produced by transfecting 
appropriate cells, typically mammalian cells, e.g. HEK-293 cells, witii a linearized 
vector that contains a TIR expression construct, i.e. TlRl, T1R2 and/or T1R3. Most 
preferably, such stable cell lines will be produced by co-transfecting two linearized 
plasmids that express hTlRl and hTlR3 or hTlR2 and hTlR3 and an appropriate 
25 selection procedure to generate cell lines having these genes stably integrated therein. 
Most preferably, the cell line will also stably express a G protein such as Gais. 

the phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture 
30 (e.g., total cellular or UbraiyDNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under 
which a probe will hybridize to its target subsequence, typically in a complex mixture 
of nucleic acid, but to no other sequences. Stringent conditions are sequence dependent 
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and wiU be different in different circumsto^ Longer sequences hybridize 
specij&cally at higher temperatures. An extensive guide to the hybridization of nucleic 
acids is found in Tijssen, Techniques in Biochemistry and Molecular Biology - 
Hybridization with Nucleic Probes, "Overview of principles of hybridization and the 

5 strategy of nucleic acid assays" (1993). Generally, stringent conditions are selected to 
be about 540° C lower than the thermal melting point (Tm) for the specific sequence at 
a defined ionic strength pH. The Tm is the temperature (under defined ionic strength, 
pH, and nucleic concentration) at which 50% of the probes complementary to the target 
hybridize to the target sequence at equilibrium (as the target sequences are present in 

10 excess, at Tm, 50% of the probes are occupied at equilibrium). Stringent conditions 
will be those in which the salt concentration is less than about 1.0 M sodium ion, 
typically about Q.Ol to 1 .0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 
and the temperature is at least about 30** C for short probes (eg., 10 to 50 nucleotides) 
anid at least about 60° C for long probes (e.g., greater than 50 nucleotides). Stringent 

15 conditions may also be achieved with the addition ofdestabilizing agents such as 
formamide. For selective or specific hybridization, a positive signal is at least two 
times background, optionally 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 
incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, 

20 and 0. 1 % SDS at 65''C. Such hybridizations and wash steps can be carried out for, e.g., 
1, 2, 5, 10, 15, 30, 60; or more minutes. 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially related if the polypeptides that they encode are substantially related. 
This occurs, for example, when a copy of a nucleic acid is created using the maximum 

25 codon degeneracy permitted by the genetic code. In such cases, the nucleic acids 
typically hybridize under moderately stringent hybridization conditions. Exemplary 
**moderately stringent hybridization conditions" include a hybridization in a buffer of 
40% formamidei 1 M NaCI, 1% SDS at 37°C, and a wash in IX SSC at 45*^C. Such 
hybridizations and wash steps can be carried out for, e.g., 1, 2, 5, 10, 15, 30, 60, or 

30 more minutes. A positive hybridization is at least twice background. Those of 

ordinary skill will readily recognize that altemative hybridization and wash conditions 
can be utilized to provide conditions of similar stringency. 
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"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an 
antigen. The recognized immunoglobulin genes include the kappa, lambda, alpha, 
gamma, delta, epsilon, and mu constant region genes, as well as the myriad 
5 immunoglobulin variable region genes. Light chains are classified as either kappa or 
lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in 
turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. 
Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
10 having one "ligjht" (about 25 kDa) and one "heavy** chain (about 50-70 kDa). The N- 
terminus of each chain defines a variable region of about 100 to 1 10 or more amino 
acids primarily responsible for antigen recognition. The terms "variable light chain" 
(VL) and "variable heavy chain" (VH) refer to these light and heavy chains 
respectively. 

15 A "chimeric antibody** is an antibody molecule m which (a) the constant region, >^ 

or a portion thereof, is altered, replaced or exchanged so that the antigen binding site - 
(variable region) is linked to a constant region of a different or altered class, effector 
fimction and/or species, or an entirely different molecule which confers new properties 
to the chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or • 
20 (b) the variable region, or a portion thereof, is altered, replaced or exchanged with a 
variable region havmg a different or altered antigen specificity. 

An "anti-TlR" antibody is an antibody or antibody firagment that specifically 
binds a polypeptide encoded by a TIR gene, cDNA, or a subsequence thereof. 

The term "immunoassay** is an assay that uses an antibody to specifically bind 
25 an antigen. The immunoassay is characterized by the use of specific binding properties 
of a particular antibody to isolate, target, and/or quantify the antigea 

The phrase "specifically (or selectively) binds** or "specifically (or selectively) 
reacts with,** when referring to a molecule or composition, refers to a binding reaction 
that is determinative of the presence of the molecule m a heterogeneous population of 
30 other biologies. Thus, under designated conditions, the specified molecules bind to a 
particular receptor at least two times tiie background and do not substantially bind in a 
significant amount to other molecules present in the sample. Specific binding to a 



88 



wo 2005/015158 PCT/US2004/025459 

receptor under such conditions may require a receptor that is selected for its specificity 
for a particular molecule. 

Regarding antibodies, a variety of immunoassay formats may be used to select 
antibodies specifically immunoreactive with a particular protein. For example, solid- 

5 phase ELISA immunoassays are routinely used to select antibodies specifically 
immunoreactive with a protem {see, e.g. , Harlow & Lane, Antibodies, A Laboratory 
Manual, (1988), for a description of immunoassay formats and conditions that can be 
used to determine specific immunoreactivity). Typically a specific or selective reaction 
will be at least twice background signal or noise and more typically more than 1 0 to 

10 100 times background. 

The phrase "selectively associates with" refers to the abiUty of a nucleic acid to 
"selectively hybridize" with another as defined above, or the ability of an antibody to 
"selectively (or specifically) bind to a protein, as defined above. 

The term "expression vectof refers to any recombinant expression system for 

1 5 the puipose of expressing a nucleic acid sequence of the mvention in vitro or in vivo^ 
constitutively or inducibly, in any cell, including prokaryotic, yeast, fimgal, plant, 
insect or mammalian cell. The term includes linear or circular expression systems. The* 
term includes expression systems that remain episomal or integrate into the host cell • 
genome. The expression systems can have the ability to self-replicate or not, i.e., drive 

20 only transient expression in a cell. The term mcludes recombinant expression 

"cassettes which contain only the minimum elements needed for transcription of the 
recombinant nucleic acid. 

By 'liost cell" is meant a cell that contains an expression vector and supports 
the replication or expression of the expression vector. Host cells may be prokaryotic 

25 cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, worm or 
mammalian cells such as CHO, Hela, HEK-293, and the like, e.g., cultured cells, 
explants, and cells in VIVO. 

Compounds 

As discussed above, there are different domains on the TIR receptors. TlRl, 
30 T1R2, and T1R3 each contain an N-teniunal extracellular domain (also known as the 
Venus flytr^ domain), transmembrane domains comprising seven transmembrane 
regions, and corresponding cytoplasmic, and extracellular loops; a cysteine rich 
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domain, and a C-terminal domain. Each region defines a specific set of compounds tiiat 
bind specifically to that region. 

In humans, the N-terminal extracellular domam comprises amino acids 1 to 560 
of hTlR2 and amino acids 1 to 563 of hTlR3. In rats, the N-terminal extracellular 

5 domain comprises amino acids 1 to 564 of rTlR2, and amino acids 1 to 568 of rTlR3. 
In humans, the C-terminal transmembrane domain and intracellular domain 
comprise amnio acids 561 to 839 of liTlR2, and amino acids 564 to 852 of hTlR3. In 
rats, the C-tenninal transmembrane domain and intracellular domain comprise amino 
acids 565 to 842 of i'TlR2, and amino acids 569 to 858 of rTlR3. 

1 0 Metabotropic glutamate receptors (mGluR) are another class of C-class G 

protein-coupled receptors that respond to glutamate. These are found primarily in the 
brain and neuronal tissue where they play a role in neuronal signaling. The mOluR N- 
terminal extracellular douMin can be covalently linked to a TIR in order to create 
' chimeric receptors. The mOluR receptor can be any of mGluRl-mOluRS, for examples 

15 Different ligands bind to different domains on different subunits of both the mnami and 
the sweet receptors. For example, aspartame and neotame bind to the N-terminal 
extracellular domain of T1R2, while cyclamate, neohesperidin dihydrochalcone 
(NHDC), and lactisole bind to the transmembrane domain of T1R3. Because T1R3 is 
one of the two subunits in the T1R1/T1R3 umami taste receptor, cyclamate, NHDC and 

20 lactisole can interact with T1R3 in the T1R1/T1R3 umami taste receptor as well. 

Cyclamate and NHDC enhance the activity of the umami taste receptor, while lactisole' 
inhibits the umami receptor. 

The specific binding compounds of the invention as it relates to umami tastants 
comprise amides. The amide compounds also comprise certain sub-classes of amide 

25 derivatives or classes of derivatives related to amides, such as for example ureas, 
urethanes, oxalamides, acrylamides, and the like. 

Molecules that interact with the t ansmembrane domain of T1R2, for example, 
can be modulators of sweet taste, and molecules that mteract with the transmembrane 
domain of T1R3 can be modulators of sweet taste and/or umami taste. 

30 Human T1R2/T1R3 recognizes a group of sweeteners which are not recognized by rat 
T1R2/T1R3, and human but not rat T1R2/T1R3 is inhibited by lactisole. Wlien the 
extracellular domain of human T1R2 was replaced by its rat coimterpart, the human 
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receptor lost the ability to recdgnize aspartame, indicating that this part of human T1R2 
is required for binding to aspartame. Inversely, when the extracellular domain of rat 
T1R2 was replaced by its human counterpart, the rat receptor acquired the ability to 
recognize aspartame, indicating that this part of the human T1R2 is sufficient to bind 
5 aspartame. By the same principle, the transmembrane domain of human T1R3 was 
required and sufficient for 

Table 6 shows the abbreviations used to represent various rat/human chimeric 
receptors and receptor subunits. 
TABLE6 

hTlR2- human T1R2 , ' 

hTlR3- human T1R3 ] 

rTlR2-ratTlR2 , 

rTlR3-ratTlR3 

hTlR2/iT lR3 - a receptor composed of human T1R2 and rat T1R3 

rTlR2/hTlR3 - a receptor composed of a rat T1R2 and human T1R3 

hTlR2/h3-r3 - a receptor composed of human T1R2 and a chimeric T1R3 with human N- 
terminal extracellular domain and rat transmembrane and C-terminal domain 

rTlR2/r3-h3 - a receptor composed of rat T1R2 and a chimeric T1R3 with rat N-termin^ 
extracellular dornain and human transmembrane and C-tenninal domain 

h2-r2/rTlR3 - a receptor composed of a chimeric T1R2 with human N-terminal extracellular 
domaiii and rat transmembrane and C-terminal domain and rat T1R3 

r2-h2/rTlR3 - a receptor composed of a chimeric T1R2 with rat N-terminal extracellular 
domain and human transmembrane and C-terminal domain and rat T1R3 

h2-hl/hTlR3 - a receptor composed of a chimeric TIR wifli human T1R2 N-terminal 
extracellular domain and human TlRl transmembrane and C-terminal domain and human 
T1R3 

hl-h2/hTlR3 - a receptor composed of a chimeric TIR with human TlRl N-terminal " 
extracellular domain and human T1R2 transmembrane and C-terminal domain and human 
T1R3 



h2-mGluRl/h3-mGluRl ~ a receptor composed of a N-terminal extracellular domain from 
hTlR2 covalently linked to the transmembrane and C-terminal domain of mGluRl and a 
N-tenninal extracellular domain from hTlR3 covalently linked to the transmembrane and 
C-terminal domain of mGluRl 

hl-mGlulR/h3-mGluRl - a receptor composed of a N-tenninal extracellular domain from 
hTlRl covalently linked to the transmembrane and C-tenninal domain of mGluRland a 
N-terminal extracellular domain from hTlR3 covalently linked to the transmembrane and 
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C-tenninal domain of mGluRl 

mGluRl-h2/mGluRl-h3 - a receptor composed of a N-terminal extracellular domain from 
mGluRl covalently linked to the transmembrane and C-tenninal domain of hTlR2 and a 
N-terminal extracellular domain from a mGluRl covalently linked to the transmembrane 
and C-terminal domain of hTlR3 



mGluRl-hl/mGluRl-h3 ~ receptor composed of a N-terminal extracellular domain from 
mGluRl covalently linked to the transmembrane and C-terminal domain of hTlRl and a 
N-terminal extracellular domain from mGluRl covalently linked to the transmembrane and 
C-terminal domain of hTlR3 



Disclosed herein are non-naturally occurring compoimds that specifically bind 
to the T1R2/T1R3 receptor comprising hTlR2/hTlR3 but not rTlR2/rTlR3. Examples 
of such compounds include, but are not limited to neptame, aspartame, cyclamate, 
5 lactisol, Compound 883360, Compound 6542888, Compound 403249, Compound 
6364395, Dihydroxybenzoic acid (DHB), Compound 6542888, and neohesperidine 
dihydrochalcone (NHDC) Additional examples are found in Tables 1-4. The organic, 
non-peptide compounds can be approximately the size of a box of dimensions 15x8x8 
angstroms, more preferably the dimension should be 12x5x5 angstroms. 
10 Also disclosed are compounds that specifically bmd to a TlR2/riR3 receptor 

comprising hTlR2/rTlR3 but not rTlR2/hTIR3. Examples of such compounds 
include, but are not limited to aspartame, and neotame. Additional examples are found 
in Table 5. 

Also disclosed are compounds that specifically bind to the N-terminal 
15 extracellular domain of T1R2 of the hTlR2/hTlR3 receptor. Examples of such 

compounds include, but are not limited to neotame, aspartame carbohydrate sugars (e.g. 
sucrose, fiuctose, glucose, tagatose, erythritol, sorbitol, maltose, xylitol, lactose and 
galactose, as well as all other carbohydrate sugars). Additional examples are found in 
Tables. 

20 Also disclosed are compounds that specifically bind to the Venus Flytrap 

Domain (VFD) of TIR2 of the hTlR2/hTlR3 and hTlR2/rTlR3 receptor. 

Also disclosed are compounds that specifically bind to the N-terminal Venus 
flytrap domain of the T1R2 subunit of the T1R2/T1R3 receptor. More specifically, also 
disclosed are compounds that specifically bind to amino acid residues 144 and 302 of 

25 the hun^nN-tenninal Venus flytrap domain oftheTlR2 subunit of aeTIR2/^ 
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receptor. Examples of such compoimds include, but are not limited to aspartame, 
neotame, carbohydrates, and sweet amino adds, such as D-Tip, Ala, and Gly. 

Also disclosed are compounds that specifically bind to the cysteine-rich region 
of T1R2 of the hTIR2/hTlR3 receptor. Also disclosed are compounds that specifically 

5 bind to the Transmembrane Domain (TM) of T1R2 of the hTlR2/hTlR3 receptor. 

Also disclosed are compounds that specifically bind to a T1R2/T1R3 receptor 
comprising rTlR2/hTlR3 but not hTlR2/rTlR3. Examples of such compounds 
include, but are not limited to cyclamate, NHDC, lactisole, Compound 883360, 
Compound 403249, and Compound 6364395. Additional examples are found in Table 

0 5. 

Also disclosed are compounds that specifically bind to hTlR2/hTlR3 and 
rTlR2/r3-h3 but not to rTlR2/rTlR3 or to hTlR2/h3-r3. Examples of such conq)ounds 
mclude, but are not limited to cyclamate, NEIDC, lactisole. Compound 883360, 
Compotmd 403249 and Compound 6364395. 
5 Also disclosed are compounds that specifically bind to extracellular loop 2 and 

extracellular loop 3 of the human C-traninal domain of the T1R3 subunit of 
theTlR2/TlR3 receptor. Also disclosed are compounds that specifically bind to 
hTlR2/hTlR3 and r2-h2/iTlR3 but not to rTlR2/iTlR3 or to h2-r2/hTlR3. 

Also disclosed are compounds that specifically bind to the human N-terminal 
0 extracellular domain of the T1R3 subunit of the T1R2/T1R3 receptor. Also disclosed 
, are compounds that specifically bind to the Venus Flytrap Domain (VFD) of T1R3 of 
the hTlR2/hTlR3 receptor. Examples of such compounds include, but are not limited 
to aspartame, neotame, carbohydrates, and sweet amino acids, such as D-Trp, Ala, and 
Gly. 

5 Also disclosed are compounds that specifically bind to the Transmembrane 

Domain of T1R3 of the hTlR2/hTlR3 recq)tor. Also disclosed are compounds that 
specifically bind to extracellular loop 2 and extracellular loop 3 of the human 
transmembrane domain of the T1R3 subunit of T1R2/T1R3. Examples of such 
compounds include, but are not limited to cyclamate. 

0 The compound of the invention does not include sucrose, fiiictose, glucose, 

erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, certain known natural terpenoids, 
flavonoids, or protein sweeteners, di-peptides, tri-peptides, aspartame, saccharin, 
sucralose, halogenated saccharides, acesulfame-K, cyclamate, sucralose, and alitame. 
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neotame, periUartine, SC-45647, SC-40014, moneUin, NC-002740-01, thaumatin, CC- 
00100, NC-00420, alitame, SC-44102, dulcin, NC-00576, slycyrrhizic Acid, stevioside, 
Na-Saccharin, D-tryptophan, cyclamate, DHB, glycolic Acid, glycine, D (-)fructose, 
homofuronol, D (-) tagatose, maltose, D (+) glucose, D-sorbitol, D (+) galactose, a- 
5 lactose, LQfructose, L (+) Compound 403249, and glucose. 

Optionally, a compound of the inyention is also not Compound 6364395. 
Also disclosed herein are compounds that bind a truncated region of a TIR 
domain. For example, disclosed are compounds that specifically bind to the TM domain 
of T1R2 of a truncated sweet receptor comprising h2TM/h3TM, compounds that 

10 specifically bind to the TM domain of T1R3 of a truncated sweet receptor comprising 
h2TM/h3TM, compounds that specifically bind to the TM domain of T1R2 of a 
chimeric receptor comprising mGluR-h2/mGluR-h3, compounds that specifically bind 
to the TM domain of T1R3 of a chimeric receptor comprising mGluR-h2/mGluR-h3, 
compounds that binds to the TM domain of TlRl of a truncated savory receptor 

15 comprising hi TM/h3TM, compound that binds to the TM domain of T1R3 of a 
truncated sweet receptor comprising hi TM/h3TM, compounds that bind to the TM 
domain of TlRl of a chimeric receptor comprising mGluR«hl/mGluR-h3, and 
compounds that bind to the TM domain of T1R3 of a chimeric receptor comprising 
mGluR-hl/mGluR-h3. SEQ ID NOS: 29-33 represent these truncated receptors. 

20 The compounds of the invention do not include monosodium glutamate 

C*MSG"), inosine monophosphate (IMP) or guanosine monophosphate (GMP), sucrose, 
fructose, glucose, erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, certain known 
natural terpenoids, flavonoids, or protein sweeteners, di-peptides, tri-peptides 
aspartame, saccharin, sucralose, halogenated saccharides, acesul&me-K, cyclamate, 

25 sucralose, alitame, monosodium glutamate CTStfSG'*)* inosine monophosphate (IMP) or 
guanosine monophosphate (GMP), or adenosine monophospate. 

Compound 403249 is (5-(4H-benzo[d][l,3]oxathiin-2-yl)-2-methyoxyphenol, 
while Compound 6364395 is 3-(3-hydroxy-4-methoxyphenethyl)benzo[d]isoxyazole- 
4,6-dioL 

30 The compounds described above can dernonstrate a compound-dependent 

increase in fluorescence with an activity compared to the maximal activity for fructose 
of at least 25% in a fluorescence-based assay using a FLIPR instrument (Fluorometric 
Intensity Plate Reader, Molecular Devices, Sunnyvale, CA). For examples of this 
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protocol, see Examples 12 and 18. The compounds can also demonstrate a compomid- 
dependeat decrease in ttie EC50 for a sweetener by at least two-fold in a fluorescence- 
based assay using a FLIPR (Molecular Devices) instrument Furthermore, in a cell- 
based assay, the compound can result in at least 10 out of 100 cells transfected with 

5 Avild-type or chimeric receptor showing a compound-dependent increase in 

fluorescence. An example of a cell-based assay can be found in Example 24. The 
compound can also demonstrate a compound-dependent increase of at least 1.1, 1.2, 
1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1 .9, two-fold or greater, or any number in between, in the 
number of fluorescent cells in response to a sub-maximal level of a sweetener. The 

10 response can be measured by fluorescence, calcium levels, IP3 levels, cAMP levels, 
GTPyS binding, or reporter gene activity (e.g. luciferase, beta-galactosidase). 

Furthennore, the compounds disclosed herein can have one or more of the 
following characteristics in a cell: a decreased EC50 compared to a control of at least 
approximately 50%, increased intracellular Ca^* level by at least approximately 25%, 

1 5 increased intracellular cAMP by at least approximately 25%, increased intracellular 
cGMP by at least approximately 25%, increased intracellular IP3 by at least 
approximately 25%, or increased G protein bmding of GTPyS by at least approximately 
25%. 

20 Meilwds of Using the Compounds 

Also disclosed are methods modulating the savory taste of a comestible or 
medicinal product comprising providing at least one comestible or medicinal product, 
or a precursor thereof, and combining the comestible or medicinal product or precursor 
thereof with at least a savory flavor modulating amount of at least one non-naturally 

25 occurring compound as disclosed herein, or a comestibly acceptable salt thereof so as 
to form a modified comestible or medicinal product; thereby modulating the savory 
taste of a comestible or medicinal product. 

Also disclosed are methods for inhibiting the savory taste of a comestible or 
medicinal product comprising providing at least one comestible or medicinal product, 

30 or a precursor thereoi^ and combining ttie comestible or medicinal product or precursor 
thereof with at least a savory flavor inhibiting amount of at least one non-naturally 
occiuring compound as disclosed herein, or a comestibly acceptable salt thereof, so as 
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to fonn a modified comestible or medicinal product; thereby inhibiting the savory taste 
of a comestible or medicinal product 

Also disclosed are methods for increasing the savory taste of a comestible or 
medicinal product comprising providiag at least one comestible or medicinal product, 
5 or a precursor thereof, and combining the comestible or medicinal product or precursor 
thereof with at least a savory flavor increasing amount of at least one non-naturally 
occurring compound as disclosed herein, or a comestibly acceptable salt thereof, so as 
to form a modified comestible or medicinal product; thereby increasing the savory taste 
of a comestible or medicinal product 

10 

Also disclosed are methods for modulating the sweet taste of a comestible or 
medicinal product comprising providing at least one comestible or medicinal product, 
or a precursor thereof, and combining the comestible or medicinal product or precursor 
thereof with at least a sweet flavor modulating amount of at least one non-naturally 

15 occiuring compound as disclosed herein, or a comestibly acceptable salt thereof, so as 
to form a modified comestible or medicinal product; thereby modulating the sweet taste 
of a comestible or medicinal product 

Also disclosed are methods for inhibiting the sweet taste of a comestible or 
medicinal product comprising providing at least one comestible or medicinal product, 

20 or a precursor thereof and combining the comestible or medicinal product or precursor 
thereof with at least a sweet flavor inhibiting amount of at least one non-naturally 
occurring compound as disclosed herein, or a comestibly acceptable salt thereof, so as 
to form a modified comestible or medicinal product; thereby inhibiting the sweet taste 
of a comestible or medicinal product 

25 Also disclosed are methods for increasing the sweet taste of a comestible or 

medicinal product comprising providing at least one comestible or medicinal product, 
or a precursor thereof and combioing the comestible or medicinal product or precursor 
thereof with at least a sweet flavor increasing amount of at least one non-naturally 
occurring compound as disclosed herein, or a comestibly acceptable salt thereoi^ so as 

30 to form a modified comestible or medicinal product; thereby increasing the sweet taste 
of a comestible or medicinal product 

Also disclosed are methods of enhancing umami taste perception conq}rising 
contacting an umami receptor with cyclamate and NHDC, and their derivatives, as well 
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as methods of enhancing umami taste perception comprising contacting an umami 
receptor with lactisole derivatives. Also disclosed are methods of enhancing sweet taste 
perception comprising contacting an sweet receptor with cyclamate and NHDC, and 
their derivatives. Also disclosed are methods of enhancing sweet taste perception 
5 comprising contacting an sweet receptor with lactisole derivatives. 

Isolation and Expression of TIR Polypeptides 

Isolation and expression of the TlRs, or fragments or variants thereof, of the 
invention can be performed as described below. PGR primers can be used for the 
amplification of nucleic acids encoding taste receptor ligand-binding regions, and 

10 libraries of these nucleic acids can optionally be generated. Individual expression 
vectors or libraries of expression vectors can then be used to infect or transfect host 
cells for the functional e3q)ression of these nucleic acids or libraries. These genes and 
vectors can be made and expressed in vitro or in vivo. One of skill will recognize that 
desired phenotypes for altering and controlling nucleic acid expression can be obtained 

15 by modulating the expression or activity of the genes and nucleic acids (e.g, , promoters, 
enhancers and the like) within the vectors of the invention. Any of the known methods 
described for increasing or decreasing expression or activity can be used. The 
invention can be practiced in conjunction with any method or protocol known in the art, 
which are well described m the scientific and patent literature. 
20 The nucleic acid sequences of the invention and other nucleic acids used to 

practice this invention, whether KNA, cDNA, genomic DNA, vectors, viruses or 
hybrids thereof, may be isolated fiom a variety of sources, genetically engineered, 
amplified, and/or expressed recombinantly. Any recombinant expression system can be 
used, including, in addition to mammalian cells, e.g., bacterial, yeast, insect, or plant 
25 systems. 

Alternatively, these nucleic acids can be synthesized in vitro by well-known 
chemical synthesis techniques, as described in, e.g. , Carmthers, Cold Spring Harbor 
Symp. Quant BioL 47:41 1-418 (1982); Adams, Am. Chenu Soc. 105:661 (1983); 
Belousov, Nucleic Acids Res. 25:3440-3444 (1997); Frenkel, FreeRadic. BioL Med 
30 19:373-380 (1995); Blommers, Biochemistry 33:7886-7896 (1994); Narang, Metk 
EnzymoL 68:90 (1979); Brown, Metk Enzymol 68:109 (1979); Beaucage, Tetra. Lett 
22:1859 (1981); U.S. Patent No. 4,458,066. Double-stranded DNA fragments may 
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then be obtained either by synthesizmg the cK>mplementaiy st^ 

strands together under appropriate conditions, or by adding the complementaiy strand 

using DNA polymerase with an appropriate primer sequence. 

Techniques for the manipulation of nucleic acids, such as, for example, for 
5 generating mutations in sequences, subcloning, labeHng probes, sequencing, 

hybridization and the like are well described in the scientific and patent literature. See, 
e.g., Sambrook, ed., Molecular Cloning: a Laboratory manual (2nd ed.), Vols. 1-3, Cold 
Spring Harbor Laboratory (1989); Current Protocols in Molecular Biology, Ausubel, 
ed. John Wiley & Sons, Inc., New York (1997); Laboratory Techniques in 

1 0 Biochemistry and Molecular Biology: Hybridization With Nucleic Acid Probes, Part I, 
Theory and Nucleic Acid Preparation, Tijssen, ed Elsevier, N.Y. (1993). 

Nucleic acids, vectors, capsids, polypeptides, and the like can be analyzed and 
quantified by any of a number of general means well known to those of skill in the art. 
These include, analytical biochemical methods such as NMR, spectrophotometry, 

1 5 radiography, electrophoresis, capillary electrophoresis, high performance liquid 
chromatography (HPLC), thin layer chromatography (TLC), and hyperdiffiision 
chromatography, various immunological methods, fluid or gel precipitin reactions, 
immunodifEusion, immunoelectrophoresis, radioimmunoassays (RIAs), enzyme-linked 
immunosorbent assays (ELIS As), immuno-fluorescent assays. Southern analysis, 

20 Northern analysis, dot-blot analysis, gel electrophoresis (e.g., SDS-PAGE), RT-PCR, 
quantitative PCR, other nucleic acid or target or signal amplification methods, 
radiolabeling, scintillation counting, and affinity chromatography. 

Oligonucleotide primers may be used to amphfy nucleic acid firagments 
encoding taste receptor Ugand-binding regions. The nucleic acids described herein can 

25 also be cloned or measured quantitatively using amplification techniques. 

Amplification methods are also well known in the art, and include, e.g., polymerase 
chain reaction, PCR (PCR Protocols, a Guide to Methods arid Applications^ ed. Innis. 
Academic Press, N.Y. (1990) and PCR Strategies, ed. Iraiis, Academic Press, Inc., N.Y. 
(1995), ligase chain reaction (LCR) (see, e.g., Wu, Genomics 4:560 (1989); Landegren, 

30 Science 241:1077, (1988); Barringer, Gene 89:117 (1990)); transcription amplification 
(see, 6.g., Kwoh, Proc. Natl. Acad. Set USA 86:1173 (1989)); and, self-sustained 
sequence replication (see, e.g, Guatelli, Proa Natl. Acad. Sci. USA 87:1874 (1990)); Q 
Beta replicase anq)lification (see, e.g.. Smith, J. Clin. Microbiol. 35:1477-1491 (1997)); 
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automated Q-beta replicase amplification assay (see, e.g.. Burg, MoL Cell Probes 
10:257-271 (1996)) and other RNA polymerase mediated techniques (e.g., NASBA, 
Cangene, Mississauga, Ontario); see also Berger, Methods Enzymol 152:307-3 16 
(1987); Sambrook; Ausubel; U.S. Patent Nos. 4,683,195 and 4,683,202; Sooknanan, 
5 Biotechnology 13:563-564 (1995). The primers can be designed to retain the original 
sequence of the "donor" 7-membrane receptor. Alternatively, the primers can encode 
amino acid residues that are conservative substitutions (eg., hydrophobic for 
hydrophobic residue, see above discussion) or functionally benign substitutions (e.g., 
do not prevent plasma membrane insertion, cause cleavage by peptidase, cause 
10 abnormal folding of receptor, and the like). Once amplified, the nucleic acids, either 
individually or as libraries, may be cloned according to itnethods known in the art, if 
desired, into any of a variety of vectors using routine molecular biological methods; 
methods for cloning in vitro amplified nucleic acids are de^scribed, eg., U.S. Pat. No. 
5,426,039. 

1 5 The primer pairs may be designed to selectively amplify ligand-biudiag regions 

of the TIR family members. These regions may vary for different ligands or tastants. 
Thus» what may be a mmimal binding region for one tastant, maybe too limiting for a 
second tastant. Accordingly, ligand-binding regions of different sizes comprising 
different extracellular domain structures may be amplified 

20 Paradigms to design degenerate primer pairs are well known in the art. For 

example, a COnsensus-DEgenerate Hybrid Oligonucleotide Primer (CODEHOP) 
strategy computer program is accessible as ht^://blocks.flicrc.org/codehop.html, and is 
directly linked fi-om the BlockMaker multiple sequence aligmnent site for hybrid 
primer prediction beginning with a set of related protein sequences, as known taste 

25 receptor ligand-binding regions {see, eg.. Rose, Nucleic Acids Res. 26:1628-1635 
(1998); Singh, Biotechniques 24:318-319 (1998)). 

Means to synthesize oligonucleotide primer pairs are well known in the art. 
'"Natural" base pairs or synthetic base pairs can be used. For example, use of artificial 
nucleobases offers a versatile approach to manipulate primer sequence and generate a 

30 more complex mixture of amplification products. Various families of artificial 

nucleobases are capable of assuming multiple hydrogen bonding orientations through 
mtemai bond rotations to provide a means for degenerate molecular recognition. 
Incorporation of these analogs into a single position of a PGR primer allows for 
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generation of a complex library of amplification products. See, e.g.y Hoops, Nucleic 
Acids Res, 25:4866-4871 (1997). Nonpolar molecules can also be used to mimic the 
shape of natural DNA bases. A non-hydrogen-bonding shape mimic for adenine can 
replicate efficiently and selectively against a nonpolar shape mimic for thymine {see, 
5 Morales, Nat Struct Biol 5:950-954 (1998)). For example, two degenerate bases 

can be the pyrimidine base 6H, 8H-3,4-dihydropyrimido[4,5-c][l,2]oxazin-7-one or die 
purine base N6-methoxy-2,6-diaminopurine (see, e.g.. Hill, Proc, Natl, Acad. Sci. USA 
95:4258-4263(1998)). Exemplary degenerate primers of the invention incorporate the 
nucleobase analog 5'-Diriiethoxytrityl-N-benzoyl-2*-deoxy-Cytidine,3*-[(2- 
10 cyanoethyl)-(N,N-diisopropyl)]-phosphoramidite (the term *T" in the sequences, see 
above). This pyrimidine analog hydrogen bonds with purines, including A and G 
residues. 

Polymorphic variants, alleles^ and interspecies homologs that are substantially 
identical to a taste receptor disclosed herein can be isolated using the nucleic add 
15 probes described above. Alternatively, expression libraries can be used to clone TIR 
polypeptides and polymorphic variants, alleles, and interspecies homologs thereof, by 
detecting expressed homologs immunologically with antisera or purified antibodies 
made against a TIR polypeptide, which also recognize and selectively bind to the TIR 
homolog. 

20 Nucleic acids that encode ligand-binding regions of taste receptors may be 

generated by amplification (e.g,y PCR) of appropriate nucleic acid sequences using 
degenerate primer pairs. The amplified nucleic acid can be genomic DNA fix)m any 
cell or tissue or mRNA or cDNA derived fix)m taste receptor-expressing cells. 

In one embodiment, hybrid protein-coding sequences comprising nucleic acids 

25 encoding TlRs fiised to translocation sequences may be constructed. Also provided are 
hybrid TlRs comprising the translocation motifs and tastant-binding domains of other 
families of chemosensory receptors, particularly taste receptors. These nucleic acid 
sequences can be operably linked to transcriptional or translational control elements, 
e,g,, transcription and translation initiation sequences, promoters and enhancers, 

30 transcription and translation terminators, polyadenylation sequences, and other 
sequences usefixl for transcribing DNA into KNA. In constitutive of recombinant 
expression cassettes, vectors, and transgenics, a promoter fragment can be employed to 
direct expression of the desired nucleic acid in all desired cells or tissues. 
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111 another embodiment, fusion proteins may include C-terminal or N-tenninal 
translocation sequences. Further, fusion proteins can comprise additional elements, 
e.g., for protein detection, purification, or other tqjplications. Detection and 
purification facilitating domains mclude, e.g., metal chelating peptides such as 
5 polyhistidine tracts, histidine-tryptophan modules, or other domains that allow 
purification on immobilized metals; maltose binding protein; protein A domains that 
allow purification on immobilized immunoglobulin; or the domain utilized in the 
FLAGS extension/affinity purification system (Immunex Corp., Seattle, WA). 

The inclusion of a cleavable linker sequences such as Factor Xa (see, e.g., 
10 Ottavi, Biochimie 80:289-293 (1998)), subtiUsin protease recognition motif (jee. e.g., 
Polyak, Protein Eng 10:615-619 (1997)); enterokinase (Invitrogen, San Diego, CA), 
and the like, between the translocation domain (for efBcient plasma membrane 
expression) and the rest of the newly translated polypeptide may be usefiil to fedlitate 
purification. For example, one construct can include a polypeptide encoding a nucleic 
15 add sequence linked to six histidine residues followed by a thioredoxin, an 

enterokinase cleavage site {see. e.g., Williams, Biochemistry 34:1787-1797 (1995)), 
and an C-terminal translocation domain. The histidine residues fecilitate detection and 
purification while the enterokinase cleavage site provides a means for purifying the 
desired protein(s) &om the remainder of the fiision protein. Technology pertaining to 
20 vectors encoding fiision proteins and application of fiision proteins are well described 
in the scientific and patent literature, see, e.g., KroU, DMA Cell. Biol. 12:441-53 (1993). 

Expression vectors, either as individual expression vectors or as libraries of 
expression vectors, comprising the ligand-binding domain encoding sequences maybe 
introduced into a genome or into the cytoplasm or a nucleus of a cell and expressed by 
25 a variety of conventional techniques, well described in the scientific and patent 

Uterature. See. e.g., Roberts, Nature 328:731 (1987); Berger supra; Schneider, Protein 
Expr. Punf. 6435:10 (1995); Sambrook; Tijssen; Ausubel. Product infomiation ftom 
manufacturers of biological reagents and experimental equipment also provide 
information regarding known biological methods. The vectors can be isolated from 
30 natural sources, obtained &om sudi sources as ATCC or GenBank libraries, or prq)aied 
by synlbetic or recombinant methods. 

The nucleic acids can be expressed using e>q)ression cassettes, vectors or 
viruses which are stably or transiently expressed in cells ie.g, episomal expression 

lOi 
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systems). Selection markers can be incorporated into expression cassettes and vectors 
to confer a selectable phenotype on transformed cells and sequences. For example, 
selection markers can code for episomal n^iaintenance and replication such tiiat 
integration into the host genome is not required. For example, the marka: may encode 
5 antibiotic resistance (eg., chloramphenicol, kanamycin, G418, blasticidin, hygromycin) 
or herbicide resistance (e.g., chlorosulfuron or Basta) to permit selection of those cells 
transformed with the desired DNA sequences (see, e.g., Blondelet-Rouault, Gene 
190:315-317 (1997); Aubrecht, J. Pharmacol, Exp. Ther. 281:992-997 (1997)). 
Because selectable marker genes conferring resistance to substrates hke neomycin or 

1 0 hygromycin can only be utilized in tissue culture, chemoresistance genes are also used 
as selectable markers m vzTro and m vivo. 

A chimeric nucleic acid sequence may encode a TIR ligand-binding domain 
within any 7-transmembrane polypeptide. Because 7-transmembrane receptor 
polypeptides have similar primary sequences and secondary and tertiary structures, 

is structural domains (eg., extracellular domain, TM domains, cytoplasmic domain, eta) 
can be readily identified by sequence analysis. For example, homology modeling, 
Foiuier analysis and helical periodicity detection can identify and characterize the 
seven domains with a 7-transmembrane receptor sequence. Fast Fourier Transform 
(FFT) algorithms can be used to assess the dominant periods that characterize profiles 

20 of the hydrophobicity and variability of analyzed sequences. Periodicity detection 
enhancement and alpha helical periodicity index can be done as by, e.g., Donnelly, 
Protein Sci. 2:55-70 (1993). Other alignment and modeling algorithms are well known 
in the art, see, e.g., Peitsch, Receptors Channels 4:161-164 (1996); Kyte & Doolittle, J. 
Med. Bio., 157:105-132 (1982); Cronet, Protein Eng 6:59-64 (1993). 

25 The present invention also includes not only the DNA and proteins having the 

specified nucleic and amino acid sequences, but also DNA firagments, particularly 
fragments of, e.g., 40, 60, 80, 100, ISO, 200, or 250 nucleotides, or more, as well as 
protein fragments o^ e.g., 10, 20, 30, 50, 70, 100, or 150 amino acids, or more. 
Optionally, the nucleic acid fragments can encode an antigenic polypeptide, which is 

30 capable of binding to an antibody raised against a TIR fanuly member. Further, a 
protein fragment of tiie invention can optionally be an antigenic fragment, vdiich is 
capable of binding to an antibody raised against a TIR family member. 
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Also contemplated are chimeric proteins, comprising at least 1 0, 20, 30, 50, 70, 
100, or ISO amino acids, or more, of one of at least one of the TIR polypeptides 
described herein, coupled to additional amino acids rqpresmting all or part of another 
GPCR, preferably a member of the 7 transmembrane superfamily. These chimeras can 
5 be made jfrom the instant receptors and another GPCR, or they can be made by 

combining two or more of the present TIR receptors. In one embodiment, one portion 
of the chimera corresponds to or is derived from the extracellular domain of a TIR 
polypeptide of the invention, hi another embodiment, one portion of the chimera 
corresponds to, or is derived from the extracellular domain and one or more of the 

1 0 transmembrane domains of a TIR polypeptide described herein, and the remaining 
portion or portions can come from another GPCR. Chimeric receptors are well known 
in the art, and the techniques for creating them and the selection and boundaries of 
domains or fragments of G protein-coupled receptors for incorporation therein are also 
well known. Thus, this knowledge of those skilled in the art can readily be used to 

IS create such chimeric receptors. The use of such chimeric receptors can provide, for 
example, a taste selectivity characteristic of one of the receptors specifically disclosed 
herein, coupled with the signal transduction characteristics of another receptor, such as 
a well known receptor used in prior art assay systems. 

As noted above, such chimeras, analogous to the native TIR receptor, or native 

20 TIR receptor combination or association will bind to and/or be activated by molecules 
that normally affect sweet taste or umami taste. Functional chimeric TIR receptors or 
receptor combinations are molecules which when expressed alone or in combination 
with other TlRs or other GPCRs (which may themselves be chuneric) bind to or which 
are activated by taste stunuh, particularly sweet (T1R2/3) or umami taste stimuli 

25 (TlRl/3). Molecules that elicit sweet taste include natural and artificial sweeteners 
such as sucrose, aspartame, xyKtol, cyclamate, et al.. Molecules that elicit umami taste 
mclude glutamate and glutamate analogs and other compounds that bind to native TlRl 
and/or T1R3, such as S'-nucleotides. 

For example, a domain such as a ligand-bindmg domain, an extracellular 

30 domain, a transmembrane domain, a transmembrane domain, a cytoplasmic domain, an 
N-terminal domain, a C-terminal domain, or any combination thereof, can be 
covalently Unked to a heterologous protem. For mstance, an TIR extracellular domain 
can be linked to a heterologous GPCR transmembrane domain, or a heterologous 
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GPCR extracellular domain can be linked to a TIR transmembrane domain. Other 
heterologous proteins of choice can be used; e.g. , green fluorescent protein. 

Also within the scope of the invention are host cells for expressing the TlRs, 
fragments, chimeras or variants of the invention, To obtain high levels of expression of 
5 a cloned gene or nucleic acid, such as cDNAs encoding the TlRs, fragments, or 

variants of the invention, one of skill typically subclones the nucleic acid sequence of 
interest into an expression vector that contains a strong promoter to direct transcription, 
a transcription/translation terminator, and if for a nucleic acid encoding a protein, a 
ribosome binding site for translational initiation. Suitable bacterial promoters are well 

1 0 known in the art and described, e.g, , in Sambrook et al However, bacterial or 
eukaryotic expression systems can be used. 

Any of the well-known procedures for introducing foreign nucleotide sequences 
into host cells may be used. These include the use of calcium phosphate transfection, 
polybrene, protoplast fusion, electroporation, liposomes, microinjection, plasma 

15 vectors, viral vectors and any of the oflier well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell 
{see, e.g., Sambrook et al) It is only necessary that the particular genetic engineering 
procedure used be capable of successfully introducing at lest one nucleic acid molecule 
into the host cell capable of expressing the TIR, fragment, or variant of interest. 

20 After the expression vector is introduced into the cells, the transfected cells are 

cultured under conditions favoring expression of the receptor, fragment, or variant of 
interest, which is then recovered from the culture using standard techniques. Examples 
' of such techniques are well known in the art. See, e.g., WO 00/06593, which is 
incorporated by reference in a manner consistent with this disclosure. 

25 

Detection of TIR polypeptides 

In addition to the detection of TIR genes and gene expression using nucleic 
acid hybridization technology, one can also use immunoassays to detect TlRs, e.g., to 
identify taste recq)tor cells, and variants of TIR family members. Immunoassays can 
30 be used to qualitatively or quantitatively analyze the TlRs. A g^eral overview of the 
applicable technology can be found in Harlow & Lane, Antibodies: A Laboratory 
Manual imS). 
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1. Antibodies to TIR family members 

Methods of producing polyclonal and monoclonal antibodies tihat react 
specifically with a TIR family member are known to those of skill in the art {see, e.g., 
Coligan, Current Protocols in Immunology (1991); Harlow & Lane, supra; Goding, 
5 Monoclonal Antibodies: Principles and Practice (2d ed. 1 986); and Kohler & Milstein, 
Nature, 256:495-497 (1975)). Such techniques include antibody preparation by 
selection of antibodies Jfrom Ubraries of recombinant antibodies in phage or similar 
vectors, as well as preparation of polyclonal and monoclonal antibodies by immunizing 
rabbits or mice (5ee, e.g., Huse a/., iScfence, 246:1275-1281 (1989); Ward a/., 

10 A^a/Mr^, 341:544-546 (1989)). 

A number of TlR-comprising immunogens maybe used to produce antibodies 
specifically reactive with a TIR family member. For example, a recombinant TIR 
polypeptide, or an antigenic fragment thereof, can be isolated as described herein. 
Suitable antigenic regions include, e.g., the consensus sequences that are used to 

15 identify members of the TIR family. Recombinant proteins can be expressed in 

eukaryotic or prokaryotic cells as described above, and purified as generally described 
above. Recombinant protein is the preferred immunogen for the production of 
monoclonal or polyclonal antibodies. Alternatively, a synthetic peptide derived fix)m 
die sequences disclosed herein and conjugated to a carrier protein can be used an 

20 immunogen. Naturally occurring protein may also be used either in pure or impure 
form. The product is then injected into an animal capable of producing antibodies. 
Either monoclonal or polyclonal antibodies may be generated, for subsequent use in 
immunoassays to measure the protein. 

Methods of production of polyclonal antibodies are known to those of skill in 

25 the art. For example, an inbred strain of mice (e.g., BALB/C mice) or rabbits is 

immunized with the protein using a standard adjuvant, such as Freund's adjuvant, and a 
standard immunization protocol The animal's immune response to the immunogen 
preparation is monitored by taking test bleeds and detennining the titer of reactivity to 
the TIR. When appropriately high titers of antibody to the immunogen are obtained, 

30 blood is collected from the animal and antisera are prepared. Further fractionation of 
the antisera to enrich for antibodies reactive to ttie protein can be done if desired {see 
Harlow & Lane, supra). 
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Monoclonal antibodies xoscy be obtained by various techniques familiar to those 
skilled in the art. Briefly, spleen cells from an animal immunized with a desired 
antigen may be immortalized, commonly by fusion with a myeloma cell (see Kohler & 
Milstein, Eur. J. Immunol^ 6:51 1-519 (1976)). Alternative methods of immortalization 
5 include transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other 
methods well known in the art. Colonies arising from single immortalized cells are 
screened for production of antibodies of the desired specificity and affinity for the 
antigen, and yield of the monoclonal antibodies produced by such cells may be 
enhanced by various techniques, including injection into the peritoneal cavity of a 

1 0 vertebrate host. Alternatively, one may isolate DNA sequences which encode a 

monoclonal antibody or a binding fragment thereof by screening a DNA library from 
human B cells according to the general protocol outlined by Huse et aL, Science, 
246:1275-1281(1989). 

Monoclonal antibodies and polyclonal sera are collected and titered against the 

1 5 immunogen protein in an immunoassay, for example, a solid phase immunoassay with - 
the immunogen immobilized on a solid support. Typically, polyclonal antisera with a 
titer of 104 or greater are selected and tested for their cross reactivity against non-TlR 
polypeptides, or even other TIR family members or other related proteins from other 
organisms, using a competitive binding immunoassay. Specific polyclonal antisera and 

20 monoclonal antibodies will usually bind with a Kd of at least about 0. 1 mM, more 
usually at least about 1 pM, optionally at least about 0.1 pM or better, and optionally 
0.01 pM or better. 

Once TIR family member specific antibodies are available, individual TIR 
proteins and protein fragments can be detected by a variety of intmiunoassay methods. 
25 For a review of unmunological and immunoassay procedures, see Basic and Clinical 
Immunology (Stites & Tot eds., 7th ed, 1991). Moreover, the immunoassays of the 
present invention can b^ p^oimed in any of several configurations, which are 
reviewed extensively in Enzyme Immunoassay (Maggio, ed., 1980); and Harlow & 
Lme, supra. 

30 

2. Immunological binding assays 

TIR proteins, fragments, and variants can be detected and/or quantified using 
any of a number of well-recognized immunological binding assays (see, e.g., U.S. 
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Patents 4,366,241; 4,376,110; 4,517,288; and 4,837,168). For areview of the general 
immunoassays, see also Methods in Cell Biology: Antibodies in Cell Biology, volume 
37 (Asai, ed. 1993); Basic and Clinical Immunology (Stites & Terr, eds., 7th ed. 1991). 
Immunological binding assays (or immunoassays) typically use an antibody that 
5 specifically binds to a protein or antigen of choice (in this case a TIR family member 
or an antigenic subsequence thereof). The antibody {e.g,, anti-TlR) may be produced 
by any of a number of means well known to those of skill in the art and as described 
above. 

Immunoassays also often use a labeling agent to specifically bind to and label 

10 the complex formed by the antibody and antigen. The labeling agent may itself be one 
of the moieties comprising the antibody/antigen complex. Thus, the labeling agent may 
be a labeled TIR polypeptide or a labeled anti-TlR antibody. Alternatively, the 
labeling agent may be a third inoiety, such a secondary antibody that specifically binds 
to the antibody/TlR complex (a secondary antibody is typically specific to antibodies 

IS of the species fix>m which the first antibody is derived). Other proteins capable of 
specifically binding immunoglobulin constant regions, such as protein A or protein G 
may also be used as the label agent. These proteins exhibit a strong non-immunogenic 
reactivity with immunoglobulin constant regions fi"om a variety of species (see, e.g., 
Kronval et aL, J, Immunol, 111:1401-1406 (1973); Akerstrom etal,J, Immunol, 

20 135:2589-2542(1985)). The labeling agent can be modified with a detectable moiety, 
such as biotin, to which another molecule can specifically bind, such as streptavidin. A 
variety of detectable moieties are well known to those skilled in the art. 

Throughout the assays, incubation and/or washing steps may be required after 
each combination of reagents. Incubation stq)s can vary from about S seconds to 

25 several hours, optionally fix>m about S minutes to about 24 hours. However, the 
incubation time will depend upon the assay format, antigen, volume of solution, 
concentrations, and the like. Usually, the assays will be carried but at ambient 
temperature, although they can be conducted over a range of teaq)eratures, such as 
10^Cto40^C. 

30 

A. Non-comnetitive assav formats 
Immunoassays for detecting a TIR polypeptide in a sample may be either 
competitive or noncompetitive. Noncompetitive immunoassays are assays in which the 
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amount of antigen is directly measured. In one preferred "sandwich" assay, for 
example, the anti-TlR antibodies can be bound directly to a solid substrate on which 
they are immobilized. These immobilized antibodies then capture the TIR polypeptide 
present in the test sample. The TIR polypeptide is thus immobilized is then bound by 
5 labeling agent, such as a second TIR antibody bearing a label. Alternatively, the 
second antibody may lack a label, but it may, in turn, be bound by a labeled third . 
antibody specific to antibodies of the species fi-om which the second antibody is 
derived. The second or third antibody is typically modified with a detectable moiety, 
such as biotin, to which another molecule specifically binds, e.g., streptavidin, to 
10 provide a detectable moiety. 

B. Competitive assay formats 
In competitive assays, the amount of TIR polypeptide present in the sample is 
measured indirectly by measuring the amount of a known, added (exogenous) TIR 
15 polypeptide displaced (competed away) firom an anti-TIR antibody by the unknown 
TIR polypeptide present in a sanq)le. In one competitive assay, a known amount of 
TIR polypeptide is added to a sample and the sample is then contacted with an 
antibody fliat specifically binds to the TIR. The amount of exogenous TIR polypeptide 
bound to the antibody is inversely proportional to the concentration of TIR polypeptide 
present in the sample. In a particularly preferred embodiment, the antibody is 
itamobilized on a solid substrate. The amount of TIR polypeptide bound to the 
antibody may be determined either by measuring the amount of TIR polypeptide 
present in a TlR/antibody complex, or alternatively by measuring the amount of 
remaining uncomplexed protem. The amount of TIR polypeptide may be detected by 
providing a labeled TIR molecule. 

A hapten inhibition assay is another preferred competitive assay. In this assay 
the known TIR polypeptide is immobilized on a solid substrate. A known amount of 
anti-TIR antibody is added to the sample, and the sample is then contacted with the 
immobilized TIR. The amount of anti-TIR antibody bound to the known immobilized 
TIR polypeptide is inversely proportional to the amount of TIR polypeptide present in 
the sample. Again, the amount of immobilized antibody may be detected by detectmg 
either the iramobilized jfraction of antibody or the fraction of the antibody that remains 
iQ solution. Detection may be direct where the antibody is labeled or indirect by the 
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subsequent addition of a labeled moiety that specifically binds to the antibody as 
described above. 

C. Cross-reactivity determinations 
5 Immunoassays in the competitive binding format can also be used for cross- 

reactivity determinations. For example, a protein at least partially encoded by the 
nucleic acid sequences disclosed herein can be immobilized to a solid support. Proteins 
(eg-. , TIR polypeptides and homologs) are added to the assay that compete for binding 
of the antisera to the immobilized antigen. The ability of the added proteins to compete 
10 for binding of the antisera to the immobiUzed protein is compared to the ability of ttie 
TIR polypeptide encoded by the nucleic ^cid sequences disclosed herein to compete 
with itself. The percent cross-reactivity for the above proteins is calculated, using 
standard calculations. Those antisera with less than 10% cross-reactivity with each of 
the added proteins listed above are selected and pooled. The cross-reacting antibodies 

1 5 are optionally removed from the pooled antisera by immunoabsoiption with the added 
considered proteins, e,g., distantly related homologs. In addition, peptides comprising 
amino acid sequences representing conserved motifs that are used to identify members 
of the TIR family can be used in cross-reactivity determinations. 

The immimoabsorbed and pooled antisera are then used in a competitive 

20 binding immunoassay as described above to compare a second protein, thought to be 
perhaps an allele or polymorphic variant of a TIR family member, to the immunogen 
protein (f.e., TIR polypeptide encoded by the nucleic acid sequences disclosed herein). 
In order to make this comparison, the two proteins are each assayed at a wide range of ' 
concentrations and the amount of each protein required to inhibit 50% of the binding of 

25 the antisera to the immobiUzed protein is determined. If the amount of the second 
protein required to inhibit 50% of binding is less than 10 times the amount of flie 
protein encoded by nucleic acid sequences disclosed herein required to uihibit 50% of 
binding, then the second protein is said to specifically bind to the polyclonal antibodies 
generated to a TIR iminimogen. 

30 Antibodies raised against TIR conserved moti& can also be used to prepare 

antibodies that specifically bind only to GPCRs of the TIR family, but not to GPCRs 
from other fanulies. 
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Polyclonal antibodies that specifically bind to a particular member of the TIR 
family can be made by subtracting out cross-reactive antibodies using other TIR family 
members. Species-specific polyclonal antibodies can be made in a similar way. For 
example, antibodies specific to human TlRl can be made by, subtracting out antibodies 
5 that are cross-reactive wifli orfliologous sequences, e.g., rat TlRl or mouse TlRl. 

D. Other assay formats 
Westem blot (immunoblot) analysis is used to detect and quantify the presence 
of TIR polypeptide in the sample. The technique generally comprises separating 

1 0 sample proteins by gel electrophoresis on the basis of molecular weight, transferring 
the separated proteins to a suitable solid support, (such as a nitrocellulose filter, a nylon 
filter, or derivatized nylon filter), and incubating the sample with the antibodies that 
specifically bind the TIR polypeptide. The anti-TlR polypq)tide antibodies 
specifically bind to the TIR polypeptide on the solid siq)port These antibodies may be 

1 5 directiy labeled or alternatively may be subsequently detected using labeled antibodies - 
(eg., labeled sheep anti-mouse antibodies) that specifically bind to the anti-TIR 
antibodies. 

Oftier, assay formats include liposome immunoassays (UA), which use 
liposomes designed to bind specific molecules (e.g., antibodies) and release 
20 encapsulated reagents or markers. The released chemicals are then detected according . 
to standard techniques (see Monroe et al, Amer. Clin. Prod. Rev., 5:34-41 (1986)). 

E. Reduction of non-specific binding 
One of skill in the art will appreciate that it is often desirable to mmimize non- 
25 specific binding in immunoassays. Particularly, where the assay involves an antigen or 
antibody immobilized on a sohd substrate it is desirable to minimize the amount of 
non-specific binding to the substrate. Means of reducing such non-specific binding are 
well known to those of skill in the art. Typically, this technique involves coating flie 
substrate with a proteinaceous composition. In particular, protein compositions such as 
30 bovine serum albumin (BSA), nonfat powdered milk, and gelatin are widely used with 
powdered milk being most preferred. 

R Labels 
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The particular label or detectable groii|> used in tbe assay is not a critical aspect 
of tile invention, as long as it does not significantly interfere with the specific binding 
of the antibody used in the assay. The detectable group can be any material having a 
detectable physical or chemical property. Such detectable labels have been well 
5 developed in the field of immunoassays and, in general, most any label useful in such 
methods can be applied to the present invention. Thus, a label is any composition 
detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, 
optical, or chemical means. Useful labels in the present invention include magnetic 
beads DYNABEADSTM), fluorescent dyes (e.g,, fluorescein isothiocyanate, 

10 Texas red, rhodamine, and the like), radiolabels (e.g., ^H, ^'^C, ^^S), enzymes (e,g., 
horseradish peroxidase, alkaline phosphates and others commonly used in an BLISA), 
and colorimetric labels such as colloidal gold or colored glass or plastic beads ie.g., 
polystyrene, polypropylene, latex, ere). 

The label may be coupled directly or indirectly to the desired component of the 

IS assay according to methods well known in tiie art. As in(Ucated above, a wide variety - 
of labels may be used, witii the choice of label depending on sensitivity required, ease 
of conjugation with the compound, stability requirements, available instrumentation, 
and disposal provisions. 

Non-radioactive labels are often attached by indirect means. Generally, a ligand 

20 molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds to 
another molecules (e.g'., streptavidin) molecule, which is either inherently detectable or 
covalently bound to a signal system, such as a detectable enzyme, a fluorescent 
compound, or a chemiluminescent compound. The ligands and their targets can be 
used in any suitable combination with antibodies that recognize a TIR polypeptide, or 

25 secondary antibodies that recognize anti-TlR. 

The molecules can also be conjugated directly to signal generating conqpounds, 
by conjugation with an enzyme or fluorophore. Enzymes of interest as labels will 
primarily be hydrolases, particularly phosphatases, esterases and glycosidases, or 
oxidotases, particularly peroxidases. Fluorescent compounds include fluorescein and 

30 its derivatives, rhodamine and its derivatives, dansyl, umbelliferbne, etc. 

Chemiluminescent compounds include luciferin, and 2,3-dihydrophthalazinediones, 
e.g., luminol. For a review of various labeling or signal producing systems tiiat may be 
used, see U.S. Patent No. 4,391,904. 
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Means of detecting labels are well known to those of skill in the art. Thus, for 
exsmph, where the label is a radioactive label, means for detection include a 
scintillation counter or photographic jStoi as in autoradiography. Where the label is a 
fluorescent label, it may be detected by exciting the fluorochrome with the appropriate 
5 wavelength of light and detecting the resulting fluorescence. The fluorescence may be 
detected visually, by means of photographic fihn, by the use of electronic detectors 
such as charge-coupled devices (CCDs) or photomultipliersmd the hke. Similarly, 
enzymatic labels may be detected by providing the appropriate substrates for the 
enzyme and detecting the resulting reaction product. Finally simple colorimetric labels 

10 may be detected simply by observing the color associated with the label. Thus, m 
various dipstick assays, conjugated gold often appears pink, while various conjugated 
beads appear the color of the bead. 

Some assay formats do not require the use of labeled components. For instance, 
agglutination assays can be used to detect the presmce of the target antibodies. In this 

15 case, antigen-coated particles are agglutinated by samples comprising the target 

antibodies. In fliis foimat, none of the components need be labeled and the presence of 
the target antibody is detected by simple visual inspection. 

Detection of Modulators 

20 Compositions and methods for determining whether a Jest compound 

specifically binds to a TIR receptor of the invention, both in vitro and in vivo, are 
described below. Many aspects of cell physiology can be monitored to assess the effect 
of ligand binding to a TIR polypeptide of the invention. These assays may be 
performed on intact cells expressing a chemosensoiy receptor, on pemeabilized cells, 

25 or on membrane fractions produced by standard methods or in vitro de novo 
synthesized proteins. 

In vivo, taste recq)tors bind tastants and initiate the transduction of chemical 
stimuli into electrical signals. An activated or inhibited G protein will in turn alter the 
properties of target enzymes, channels, and other effector proteins. Some exanq)les are 

30 the activation of cGMP phosphodiesterase by transducin m flie visual system, adenylate 
cyclase by the stimulatory G protem, phospholipase C by Gq and other cognate G 
proteins, and modulation of diverse channels by Gi and other G proteins. Downstream 
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consequences can also be examined such as generation of diacyl glycerol and P3 by 
phospholipase C, and in turn, for calcium mobilization by IPS. 

The TIR proteins or polypeptides of the assay will preferably be selected from a 
polypeptide having the TIR polypeptide sequence selected from those disclosed in 
5 Example 1, or fragments or conservatively modified variants tiiereof. Optionally, the 
fragments and variants can be antigenic fragments and variants which bind to an anti- 
TIR antibody. Optionally, the fragments and variants can bind to or are activated by 
sweeteners or umami tastants. 

Alternatively, the TIR proteins or polypeptides of the assay can be derived from 

10 a eukaryotic host cell and can include an amino acid subsequence having amino acid 
sequence identity to the TIR polypeptides disclosed in Example 1, or fragments, or 
conservatively modified variants thereof Generally, the amino acid sequence identity 
will be at least 35 to 50%, or optionally 75%, 85%, 90%, 95%, 96%, 97%, 98%, or 
99%. Optionally, the TIR proteins or polypeptides of the assays can comprise a 

15 domain of a TIR protein, such as an extracellular domain, transmembrane region, 
transmembrane domain, cytoplasmic domain, hgand-binding domain, and the like. 
Further, as described above, flie TIR protein or a domain thereof can be covalentiy 
linked to a heterologous protein to create a chimeric protein used in the assays 
described herein. 

20 Modulators of TIR receptor activity are tested using TIR proteins or 

polypeptides as described above, either recombinant or naturally occurring. The TIR 
proteins or polypeptides can be isolated, co-expressed in a cell, co-expressed in a 
membrane derived from a cell, co-expressed in tissue or in an animal, either 
recombinant or naturally occuning. For example, tongue shces, dissociated cells from 

25 a tongue, transformed cells, or membranes can be used. Modulation can be tested using 
one of the in vitro or in vivo assays described herein. 

For example, as disclosed in the experiment examples infra, it has been 
discovered that certain 5' nucleotides, e.g., 5' IMP or 5* GMP, enhance the activity of 
L-glutamate to activate the umami taste receptor, or block the activation of the umami 

30 taste recq)tor by umami taste stimuli such as L-glutamate and L-aspartate. 

1. //I vi/ra binding assays 
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Taste transchcrtion can dso be examined m vi<ra with soluble or solid State 
reactions, using the TIR polypeptides of the invention. In a particular embodiment, 
TIR ligand-binding domains can be used in vitro in soluble or solid state reactions to 
assay for ligand binding. 
5 For instance, the TIR N-tenninal domain is predicted to be involved in ligand 

binding. More particularly, the TlRs belong to a GPCR sub-family that is 
charaotaized by large, approximately 600 amino acid, extracellular N-tenminal 
segments. These N-terminal segments are thought to form the ligand-binding domains, 
and are therefore useful in biochemical assays to identify TIR agonists and antagonists. 
10 It is possible that the Ugand-binding domain may be formed by additional portions of 
the extracellular domain, such as the extracellular loops of the transmranbrane domain. 

In vitro binding assays have been used with other GPCRs that are related to the 
TlRs, such as the metabotropic glutamate receptors (see. e.g., Han and Hampson, J. 
Biol. Chem. 274:10008-10013(1999)). These assays might involve displacing a 
15 radioactively or fluorescently labeled ligand, measuring changes in intrinsic 
fluorescence or changes in proteolytic susceptibiiity, efc. 

Ligand binding to a hetero-multimeric complex of TIR polypeptides of flie 
invention can be tested in solution, in a bilayw membrane, optionally attached to a solid 
phase, in a lipid monolayer, or in vesicles. Binding of a modulator can be tested usmg, 
20 e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbence, refractive 
index) hydrodynamic i&g., shape), dhromatographic, or solubility properties. 
[0207] In another embodiment of the invention, a GTPy^^S assay may be used. As 
described above, upon activation of a GPCR, the Ga subunit of the G protein complex 
is stimulated to exchange bound GDP for GTP. ligand-mediated stimulation of G 
25 protein exchange activity can be measured in a biochemical assay measuring the 
bmding of added radioactively labeled GTPy'^S to the G protein in the presence of a 
putative ligand. Typically, membranes containing the chemosensory receptor of 
interest are mixed wifli a complex of G proteuis. Potential mhibitors and/or activators 
and GTPy'^S are added to the assay, and binding of GTlV^S to the G protein is 
30 measured. Binding can be measured by liquid scintillation counting or by any other 
means known in the art, including scintillation proximity assays (SPA). In o&er assays 
formats, fluorescently labeled GTPyS can be utilized. 
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2. Flaorescence Polarization Assays 
In anotiier embodiment. Fluorescence Polarization ('TP") based assays may be 
used to detect and monitor ligand binding. Fluorescence polarization is a versatile 
laboratory technique for measuring equilibrium binding, nucleic acid hybridization, and 
5 enzymatic activity. Fluorescence polarization assays are homogeneous in that they do 
not require a separation step such as centrifugation, filtration, chromatography, 
precipitation, or electrophoresis. These assays are done in real time, directly in solution 
and do not require an immobilized phase. Polarization values can be measured 
repeatedly and after the addition of reagents since measuring the polarization is rapid 
10 and does not destroy the sample. Generally, this technique can b^ used to measure 
polarization values of fluorophores fi-om low picomolar to micromolar levels. This 
section describes how fluorescence polarization can be used in a simple and 
quantitative way to measure the binding of ligands to the TIR polypeptides of the 
invention. 

15 When a fluorescently labeled molecule is excited with plane-polarized ligjht, it . 

emits light that has a degree of polarization that is inversely proportional to its 
molecular rotation. Large fluorescently labeled molecules remain relatively stationary 
during the excited state (4 nanoseconds in the case of fluorescein) and the polarization 
of the light remains relatively constant between excitation and emission. Small 

20 fluorescently labeled molecules rotate rapidly during the excited state and the 

polarization changes significantly between excitation and emission. Therefore, small 
molecules have low polarization values and large molecules have high polarization 
values. For example, a single-stranded fluorescein-labeled ohgonucleotide has a 
relatively low polarization value but when it is hybridized to a complementary strand, it 

25 has a higher polarization value. When using FP to detect and monitor tastant-binding 
which may activate or inhibit the chemosensory receptors of the invention, 
fluorescence-labeled tastants or auto-fluorescent tastants may be used. 
Fluorescence polarization (P) is defined as: 

^ Int^ -^Int^ 

30 Where n is the intensity of the enaission light parallel to the excitation light plane and 
Int lis the intensity ofthe emission Ught perpendicular to the excitation Ughtpla^^^ P, 
being a ratio of light intensities, is a dimensionless number. For example, the Beacon 
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® and Beacon 2000 ™ System may be used in connection with these assays. Such 
systems typically repress polarization in millipolarization units (1 Polarization Unit 
=1000 mP Units). 

The relationship between molecular rotation and size is described by the Perrin 
5 equation and the reader is referred to Jolley, M. E. (1991) in Journal of Analytical 
Toxicology, pp. 236-240, which gives a thorough explanation of this equation. 
Summarily, the Perrin equation states that polarization is directly proportional to the 
rotational relaxation time, the time that it takes a molecule to rotate through an angle of 
approximately 68.5° Rotational relaxation time is related to viscosity (r\% absolute 
1 0 temperature (T), molecular; volume (V), and the gas constant (R) by the following 
equation: 

3nV 

RotationalRelaxationTime =^ 

RT 

The rotational relaxation time is small (» 1 nanosecond) for small molecules 
(e g. fluorescein) and large (» 100 nanoseconds) for large molecules (e.g. 

1 5 immunoglobulins). If viscosity and temperature are held constant, rotational relaxation 
time, and therefore polarization^ is directly related to the molecular volume. Changes 
in molecular volume may be due to interactions with oflier molecules, dissociation, 
polymerization, degradation, hybridization, or conformational changes of the 
fluorescently labeled molecule. For example, fluorescence polarization has been used 

20 to measure enzymatic cleavage of large fluorescein labeled polymers by proteases, 
DNases, and RNases. It also has been used to measure eqxiilibrium binding for 
protein/protein interactions, antibody/antigen binding, and protein/DNA binding. 

A. Solid state and soluble hidi tliroughout assavs 
25 In yet another embodiment, the invention provides soluble assays using a 

hetero-oligomeric TIR polypeptide complex; or a cell or tissue co-expressing TIR 
polypeptides. Preferably, the cell will comprise a cell line that stably co-expresses a 
functional T1R1/T1R3 (umami) taste receptor or T1R2/T1R3 (sweet) taste receptor. In 
another embodiment, the invention provides solid phase based in vitro assays m a high 
30 tiurou^put format, where the TIR polypeptides, or cell or tissue expressing the TIR 
polypeptides is attached to a solid phase substrate or a taste stimulating compound and 
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contacted with a TIR receptor, and binding detected using an appropriate tag or 
antibody raised against the TIR receptor. 

In the high throughput assays of the invention, it is possible to screen up to 
several thousand different modulators or ligands in a single day. In particular, each 
5 well of a microtiter plate can be used to run a separate assay against a selected potential 
modulator, or, if concentration or incubation time effects are to be observed, every 5-10 
wells can test a single modulator. Thus, a single standard microtiter plate can assay 
about 100 (eg., 96) modulators. If 1536 well plates are used, then a single plate can 
easily assay from about 1000 to about 1500 different compounds. It is also possible to 
10 assay multiple compounds in each plate well. It is possible to assay several different 
plates per day; assay screens for up to about 6,000-20,000 different compounds is 
possible using the integrated systems of the invention. More recently^ microfluidic 
approaches to reagent manipulation have been developed. 

The molecule of interest can be bound to the solid state component, directly or * 
1 5 indirectly, via covalent or hon-covalent linkage, e.g., via a tag. The tag can be any of a 
variety of components. In general, a molecule which binds the tag (a tag binder) is 
fixed to a solid support, and the tagged molecule of interest (eg., the taste transduction 
molecule of interest) is attached to the solid support by interaction of the tag and the tag 
binder. 

20 A number of tags and tag binders can be used, based upon known molecular 

interactions well described in the literature. For example, where a tag has a natural 
binder, for example, biotin, protein A, or protein G, it can be used in conjunction with 
appropriate tag binders (avidin, streptavidin, neutravidin, the Fc region of an 
immunoglobulin, etc.). Antibodies to molecules with natural binders such as biotin are 

25 also widely available and appropriate tag binders (see^ SIGMA Immunochemicals 1998 
catalogue SIGMA, St. Louis MO). 

Similarly, any haptenic or antigenic compound can be used in combination witii 
an appropriate antibody to foim a tag/tag bmder pair. Thousands of specific antibodies 
are commercially available and many additional antibodies are described in the 

30 literature. For example, in one common configuration, the tag is a first antibody and 
the tag binder is a second antibody which recognizes tiie first antibody. In addition to 
antibody-antigen interactions, receptor-ligand interactions are also appropriate as tag 
and tag-binder pairs. For example, agonists and antagonists of cell membrane receptors 
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{e.g,, cell receptor-ligand interactions such as transferrin, c-kit» viral receptor ligands, 
cytokine receptors, chemokine receptors, interleukin receptors, immunoglobulin 
receptors and antibodies, the cadherein fanaily, the integrin family, the selectin family, 
and the Uke; see, e,g., Pigott & Power, The Adhesion Molecule Facts Book I (1993)). 
5 Similarly, toxins and venoms, viral epitopes, hormones (e.g,^ opiates, steroids, etc.), 
intracellular receptors (e,g., which mediate the effects of various small ligands, 
including steroids, thyroid hormone, retinoids and vitamin D; peptides), drugs, lectms, 
sugars, nucleic acids (both linear and cyclic polymer configurations), oUgosaccharides, 
proteins, phospholipids and antibodies can all interact with various cell receptois. 
1 0 Synthetic polymers, such as polyurethanes, polyesters, polycarbonates, 

polyureas, polyamides, polyethylenemiines, polyarylene sulfides, polysiloxanes, 
polyimides, and polyacetates can also form an appropriate tag or tag binder. Many 
other tag/tag binder pairs are also useful in assay systems described herein, as would be .\ 
apparent to one of skill upon review of this disclosure. 
15 Common linkers such as peptides, polyethers, and the like can also serve as : i 

tags, and include polypeptide sequences, such as poly gly sequences of between about 5 ' 
and 200 amino acids. Such flexible Unkers are known to persons ofskill in the art. For - 
example, poly(ethelyne glycol) Imkers are available from Shearwater Polymers, Inc. 
Huntsville, Alabama. These linkers optionally have amide linkages, sulfhydryl 
20 linkages, or heterofiinctional linkages. 

Tag binders are fixed to solid substrates using any of a variety of methods 
currently available. SoUd substrates are commonly derivatized or fimctionalized by 
exposing all or a portion of the substrate to a chemical reagent which fixes a chemical 
group to the surface which is reactive with a portion of the tag binder. For example, 
25 groups which are suitable for attachment to a longer chain portion would include 
amines, hydroxyl, thiol, and carboxyl groiqis. Aminoalkylsilanes and 
hydroxyalkylsilanes can be used to flmctionalize a variety of surfaces, such as glass 
surfaces. The constitutive of such solid phase biopolymer arrays is well described in 
the literature. See, e.g., Merrifield, J. Am. Chem. Soc, 85:2149-2154 (1963) 
30 (describing solid phase synthesis of, e.g., peptides); Geysen^M/., j: Immwt. Meth., 

102:259-274 (1987) (describing synthesis of solid phase components on pins); Frank & 
Doring, Tetrahedron, 44:60316040 (1988) (describing synthesis of various peptide 
sequences on cellulose dislcs); Fodor et aL, Science, 251:767-777 (1991); Sheldon et 



118 



wo 2005/015158 PCT/US2004/025459 

aL, Clinical Chemistry, 39(4):718-719 (1993); and Kozal et aL, Nature Medicine, 
2(7):753759 (1996) (all describing arrays of biopolymers fixed to solid substrates). 
Non-chemical ^proaches for fixing tag binders to substrates include other conunon 
methods, such as heat, cross-linkmg by UV radiation, and the like. 

5 

3. Cell-based assays 
la a preferred embodiment of treatment, a combination of TIR proteins or 
polypeptides are transiently or stably co-expressed in a eukaryotic cell either in 
unmodified forms or as chimeric, variant or truncated receptors with or preferably 

1 0 without a heterologous, chaperone sequence that facilitates its maturation and targeting 
through the secretory pathway. Such TIR polypeptides can be expressed in any 
eukaryotic cell, such as HEK-293 cells. Preferably, the cells comprise a fimctional G 
protein, e.g,, Gal5 or the chimeric G protein previously identified, or another G protein 
that is capable of coupling the chuneric receptor to an intracellular signaling pafliway 

15 or to a signaling protein such as phospholipase C. Also, preferably a cell will be 

produced that stably co-expresses TlRiyTlR3 or T1R2/T1R3 as such cells have been 
found (as shown in the experimental examples) to exhibit enhanced responses to taste 
stimuli (relation to cells that transiently express the same TIR combination). 
Activation of TIR receptors in such cells can be detected using any standard method, 

20 such as by detecting changes in intracellular calcium by detecting Fluo-4 dependent 
fluorescence in the cell. Such an assay is the basis of the experimental findings 
presented in this application. 

Activated GPCR receptors often are substrates for kinases that phosphorylate 
the C-terminal tail of the receptor (and possibly other sites as well). Thus, activators 

25 will promote the transfer of ^^P firom radiolabeled ATP to the receptor, which can be 
assayed with a scintillation counter. The phosphorylation of the C-tenninal tail will 
promote the binding of arrestin-like proteins and will interfere with the bindmg of G 
proteins. For a general review ofGPCR signal transduction and methods of assaying 
signal transduction, see, e.g.. Methods in Eftzymology, vols. 237 and 238 (1994) and 

30 volume 96 (1983); Bourne et al. Nature, 10:349:1 17-27 (1991); Boume et aL, Nature, 
348:125-32 (1990); Vitcheret al.,Annu. Rev. Biochem., 67:653-92 (1998). 

TIR modulation may be assayed by comparing the re^onse of TIR 
polypeptides treated with a putative TIR modulator to the response of an untreated 
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control sample or a sample containing a known '^positive" control. Such putative TIR 
modulators can include molecules that eitihier inhibit or activate TIR polypeptide 
activity. Jn one embodiment, control samples (untreated with activators or inhibitors) 
are assigned a relative TIR activity value of 100. Inhibition of a TIR polypeptide is 
5 achieved when the TIR activity value relative to the control is about 90%, optionally 
50%, optionally 25-0%. Activation of a TIR polypeptide is achieved when the TIR 
activity value relative to the control is 110%, optionally 150%, 200-500%, or 1000- 
2000%. 

Changes in ion flux may be assessed by deteimining changes in ionic 

10 polarization (te,, electrical potential) of the cell or membrane expressiug a TIR 

polypeptide. One means to determine changes in cellular polarization is by measuring 
changes in current (thereby measuring changes in polarization) with voltage-clamp and, 
patch-clamp techniques (see, e.g., the "cell-attached" mode, the "inside-out" mode, and 
the *Svhole cell" mode, e.g., Ackerman et aL, New Engl J Med., 336:1575-1595 

1 5 (1 997)). Whole cell currents are conveniently determined using Ihe standard. Other 
known assays include: radiolabeled ion flux assays and fluorescence assays using 
voltage-sensitive dyes {see, e.g., Vestergarrd-Bogind et al, J. Membrane Biol, 
88:67-75 (1988); Gonzales & Tsien, Chem. Biol, 4:269277 (1997); Daniel et al, J. 
Pharmacol Metk, 25:185-193 (1991); Holevinsky et al, J, Membrane Biology, 

20 137:59-70(1994)). 

The effects of the test compounds upon the function of the polypeptides can be 
measured by examining any of the parameters described above. Any suitable 
physiological change that affects GPCR activity can be used to assess the influence of a 
test compound on the polypeptides of this invention. When the functional 

25 consequences are determined using intact cells or animals, one can also measure a 
variety of effects such as transmitter release, hormone release, transcriptional changes 
to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second 
messengers such as Ca^^ IP3, cGMP, or cAMP. 

30 Preferred assays for GPCRs include cells that are loaded with ion or voltage 

sensitive dyes to report receptor activity. Assays for determining jactivity of such 
receptors can also use known agonists and antagonists for other G protem-coupled 
receptors as controls to assess activity of tested compounds. In assays for identifymg 
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modulatory compounds (e,g,, agonists, antagonists), changes in the level of ions in the 
cytoplasm or membrane voltage will be monitored using an ion sensitive or membrane 
voltage fluorescent indicator, respectively. Among tiie ion-sensitive indicators and 
voltage probes that may be employed are those disclosed in the Molecular Probes 1997 
5 Catalog. For G protein-coupled receptors, promiscuous G proteins such as Gal 5 and 
Gal6 can be used in the assay of choice (WiUde et aL, Proc. Nat 7 Acad. ScL^ 
88:10049-10053(1991)). 

Receptor activation initiates subsequent intracellular events, e.g,^ increases in 
second messengers. Activation of some G protein-coupled receptors stimulates the 

10 formation of inositol triphosphate (IP3) through phosphoUpase C-mediated hydrolysis 
of phosphatidylinositol (Benidge & Irvine, Nature, 312:315-21 (1984)), IPS in tum 
stimulates the release of intracellular calcium ion stores. Thus, a change in cytoplasmic 
calcium ion levels, or a change in second messenger levels such as IP3 can be used to 
assess G protein-coupled receptor function. Cells expressing such G protein-coupled 

1 5 receptors may exhibit increased cytoplasmic calcium levels as a result of contribution 
from both calcium release from intracellular stores and extracellular calcium entry via 
plasma membrane ion chaimels. 

In a preferred embodiment, TIR polypeptide activity is measured by stably or 
transientiy co-expressing TIR genes, preferably stably, in a heterologous cell with a 

20 promiscuous G protein that links the receptor to a phospholipase C signal transduction 
pathway (see Offermanns & Simon, J. Biol Chem., 270:15175-15180 (1995)). In a 
preferred embodiment, the cell line is HEK-293 (which does not normally express TIR 
genes) and the promiscuous G protein is Gal 5 (Offermanns & Simon, supra). 
Modulation of taste transduction is assayed by measuring changes in intracellular Ca^"*" 

25 levels, which change in response to modulation of the TIR signal transduction pathway 
via adnunistration of a molecule that associates with TIR polypeptides. Changes in 
Ca^*** levels are optionally measured using fluorescent Ca^"** indicator dyes and 
fluorometric imaging. 

In another embodunent, phosphatidyl inositol (PI) hydrolysis can be analyzed 

30 according to U.S. Patent 5,436, 128, herein incorporated by reference. Briefly, the 
assay involves labeling of cells with 3H-myoinositol for 48 or more hrs. The labeled 
cells are treated with a test compound for one hour. The treated cells are lysed and 
extracted in chlorofoim-methanol-water after which the inositol phosphates were 
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separated by ion exchange chromatography and quantified by scintillation counting. 
Fold stimulation is determined by calculating the ratio of q>m in the presmce of 
agonist, to cpm in the presence of buffer control Likewise, fold inhibition is 
determined by calculating the ratio of qpm in the presence of antagonist, to cpm in the 
5 presence of buffer control (which may or may not contain an agonist). 

Other receptor assays can involve determining the level of intracellular cyclic 
nucleotides, cAMP or cGMP. In cases where activation of the receptor results in a 
decrease in cyclic nucleotide levels, it may be preferable to expose the cells to agents 
that increase intracellular cyclic nucleotide levels, e.g., forskolin, prior to adding a 

10 receptor-activating compound to the cells in the assay. In one embodiment, the changes 
in intracellular cAMP or cGMP can be measured using immunoassays. The method 
described in OfiFermanns & Sunon, J! Bio, Chem., 270:15175-15180 (1995), maybe 
used to determine the level of cAMP. Also, the method described in Felley-Bosco et 
al^ Am. J. Resp. Cell and Mol BioL, 11:1 59-164 (1994), may be used to detennine the 

15 level of cGMP. Further, an assay kit for measuring cAMP and/or cGMP is described in 
U.S. Patent 4,115,538, herein incorporated by reference. 

In another embodiment, transcription levels can be measured to assess the 
effects of a test compound on signal transduction. A host cell containing TIR 
polypeptides of int^st is contacted with a test compound for a sufQcient time to effect 

20 any interactions, and then the level of gene expression is measured. The amount of 
time to effect such interactions may be empirically detemiined, such as by running a 
time course and measuring the level of transcription as a function of time. The amount 
of transcription may be measured by using any method known to those of skill in the art 
to be suitable. For example, mRNA expression of the protein of interest may be 

25 detected using northern blots or their polypeptide products may be identified using 
immunoassays. Altematively, transcription based assays using reporter gene may be 
used as described in U.S. Patent 5,436,128, herein incorporated by reference. The 
reporter genes can be, e.g., chloramphenicol acetyltransferase, luciferase, 
beta-galactosidase beta-lactamase and alkaline phosphatase. Furtiiermore, the protem 

30 of interest can be used as an indirect reporter via attachment to a second reporter such 
as green fluorescent protem (see^ eg., Mistili & Spector, Nature Biotechnology, 
15:961-964(1997)). 
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The amount of transcription is then compared to the amount of transcription in 
either the same cell in the absence of the test compound, or it may be compared with 
the amount of transcription m a substantially identical cell that lacks the TIR 
polypeptide(s) of interest. A substantially identical cell may be derived from the same 
5 cells from which the recombinant cell was prepared but which had not been modified 
by introduction of heterologous DNA. Any difference in the amount of transcription 
indicates that the test compound has in some maimer altered the activity of the TIR 
polypeptides of interest 

10 4. Transgenic non-human animals expressing chemosensory receptors 

Non-human animals expressing a combination of TIR taste receptor sequences 
ofthe invention can also be used for receptor assays. Such expression can be used to 
determine whether a test compound specifically binds to a mammalian taste 
transmembrane receptor complex in vivo by contacting a non-human animal stably or 

1 5 transiently transfected with nucleic acids encoding chemosensory receptors or ligand- 
binding regions thereof with a test compound and detemiining whether the animal 
reacts to the test compound by specifically binding to the receptor polypeptide 
complex. 

Animals transfected or infected with the vectors of the invention are particularly 
20 useful for assays to identify and characterize taste stimuli that can bind to a specific or 
sets of receptors. Such vector-infected animals expressing human taste receptor 
sequences can be used for in vivo screening of taste stimuli and their effect on, e.g., cell 
physiology (e.g., on taste neurons), on the CNS, or behavior. Alternatively, stable cell 
lines that express a TIR or combination thereof, can be used as nucleic transfer donors 
25 to produced cloned transgenic animals that stably express a particular TIR or 

combination. Methods of using nucleic transfer to produce cloned animals that express 
a desired heterologous DNA are the subject of several issued U.S. patents granted to the 
University of Massachusetts (licensed to Advanced Cell Technology, Inc.) and Roslin 
Institute (licmsed to Geron C!orp.). 
30 Means to infect/express the nucleic acids and vectors, either individually or as 

libraries, are well known m the art. A variety of individual cell, organ, or whole animal 
parameters can be measured by a variety of means. The TIR sequences of the 
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invention can be for example co-expressed in animal taste tissues by delivery with an 
infecting agent, e.g., adenovirus expression vector. 

The endogenous taste receptor genes can remain functional and wild-type 
(native) activity can still be present. In other situations, where it is desirable that all 
5 taste receptor activity is by the introduced exogenous hybrid receptor, use of a 
knockout line is preferred. Methods for the constitutive of non-human transgenic 
animals, particularly transgenic mice, and tiie selection and preparation of recombinant 
constructs for generating transformed cells are well known in the art. 

Constitutive of a "knockouf cell and animal is based on the premise that the 

10 level of expression of a particular gene in a mammalian cell can be decreased or 

completely abrogated by introducing into the genome a new DNA sequence that serves 
to interrupt some portion of the DNA sequence of the gene to be suppressed. Also, 
"gene trap insertion" can be used to disrupt a host gene, and mouse embryonic stem 
(ES) cells can be used to produce knockout transgenic animals (see, e.g., Holzschu, 

15 Transgenic Res 6:97-106 (1997)), The insertion ofthe exogenous is typically by 
homologous recombination between complementary nucleic acid sequences. The 
exogenous sequence is some portion of the target gene to be modified, such as exonic, 
intronic or transcriptional regulatory sequences, or any genomic sequence which is able 
to aflFect the level ofthe target gene's expression; or a combination thereof. Gene 

20 targeting via homologous recombination in pluripotential embryonic stem cells allows 
one to modify precisely the genomic sequence of interest Any technique can be used^ 
to create, screen for, propagate, a knockout aimnal, e.g., see Bijvoet, Hum. Mol Genet 
7:53-62 (1998); Moreadith, J. Mol Med. 75:208-216 (1997); Tojo, Cytotechnology 
19:161-165 (1995); UvAgda, Methods Mol Biol 48:167-184 (1995); Longo, 

25 Transgenic Res. 6:321-328 (1997); U.S. Patents Nos. 5,616,491; 5,464,764; 5,631,153; 
5,487,992; 5,627,059; 5,272,071; WO 91/09955; WO93/09222; WO 96/2941 1; WO 
95/31560; WO 91/12650. 

The nucleic acids ofthe invention can also be used as reagents to produce 
*Tcnockout" human cells and their progeny. Likewise, the nucleic acids ofthe invention 

30 can also be used as reagents to produce "knock-ins" in mice. The human or rat TIR 
gene sequences can replace the orthologous TIR in the mouse genome. In this way, a 
mouse expressing a human or rat TIR is produced. This mouse can then be used to 
analyze the function of human or rat TlRs, and to identify ligands for such TlRs. 
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a. Modulators 

The compounds tested as modulators of a TIR family member can be any small 
chemical compound, or a biological entity, such as a protein, nucleic acid or Upid. 
Examples thereof include 5* IMP and 5^ GMP. Essentially any chemical compound 
5 can be used as a potential modulator or hgand in the assays of the invention, although 
most often compounds that are soluble in aqueous solutions are tested Assays can be 
designed to screen large chemical libraries by automating the assay steps and providing 
compounds from any convenient source; these assays are typically run in parallel {e,g. ^ 
in microtiter formats on microtiter plates in robotic assays). It wiU be appreciated that 
10 chemical libraries can be synthesized by one of many chemical reactions (e.g. Senomyx 
proprietary chemistries). Additionally, there are many suppliers of chemical 
compoimds, including Sigma (St Louis, MO), Aldrich (St Louis, MO), Sigma-Aldrich 
(St. Louis, MO), Fluka Chemika-Biochemica Analytika (Buchs, Switzerland) and the 
like. 

IS In one preferred embodiment, high throughput screening methods involve 

providing a combmatorial chemical or peptide library containing a large number of 
potential taste affecting compounds (potential modulator or ligand compounds). Such 
"combinatorial chemical libraries" or "ligand libraries" are then screened in one or 
more assays, as described herein, to identify those Ubrary members (particular chemical 

20 species or subclasses) that display a desired characteristic activity. The compounds 
thus identified can serve as conventional "lead compounds" or can themselves be used 
as potential or actual taste modulators. 

Preferably, such Hbraries will be screened against cells or cell lines that stably 
express a TIR or combination of TlRs, i.e. T1R1/T1R3 or T1R2/T1R3 and preferably 

25 a suitable G protein, e.g. Gats- As shown in the examples infra, such stable cell lines 
exhibit very pronounced responses to taste stimuli, e.g. umami or sweet taste stimuli. 
However, cells and cell lines that transiently express one or more TlRs may also be 
used in such assays. 

A combinatorial chemical library is a collection of diverse chemical conq)ounds 
30 generated by either chemical synthesis or biological synthesis, by combining a nimiber 
of chemical **building blocks" such as reagents. For example, a linear combinatorial 
chemical library such as a polypeptide library is formed by combining a set of chemical 
building blocks (amino acids) in every possible way for a given compound length (i.e.. 
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fhe number of amino acids in a polypeptide compound). Thousands to millions of 
chemical compounds can be synthesized through such combinatorial mixing of 
chemical building blocks. 

Preparation and screening of combinatorial chemical libraries is well known to 
5 those of skill in the art. Such combinatorial chemical libraries include, but are not 
limited to, peptide libraries {see, e.g., U.S. Patent 5,010,175, Furka, Int J. Pept Prot 
Res., 37:487-493 (1991) and Houghton et al. Nature, 354:84-88 (1991)). Other 
chemistries for generating chemically diverse libraries can also be used. Such 
chemistries include, but are not limited to: peptoids {e.g., PCX Publication No. WO 

10 91/19735), encoded peptides {e.g., PCT Publication WO 93/20242), random 

bio-oligomers {e.g., PCT Publication No. WO 92/00091), benzodiazepines (e.g., U.S. 
Pat. No. 5,288,514), diyersomers such as hydantoins, benzodiazepines and dipeptides 
(Hobbs et al, Proc. Nat Acad. 5ci., 90:6909-6913 (1993)), vinylogous polypeptides 
CHagihara J. Amer. Chem, Soa, 114:6568 (1992)), nonpeptidal peptidomimetics 

15 with glucose scaffolding (Hirschmann ^t al, J. Amer. Chem. Soc, 1 14:9217-921 8 

(1992)), analogous organic syntheses of small compound libraries (Chen et al, J. Amer. 
Chem. Soc, 1 16:2661 (1994)), oligocarbamates (Cho et al. Science, 261:1303 (1993)), 
peptidyl phosphonates (Campbell et al, J. Org. Chem., 59:658 (1994)), nucleic acid 
libraries (Ausubel, Berger and Sambrook, all supra), peptide nucleic acid libraries (U.S. 

20 Patent 5,539,083), antibody libraries (V aughn et al, Nature Biotechnology, 

14(3):309-314 (1996) and PCT/US96/10287), carbohydrate libraries (Liang et al, 
Science, 274:1520-1522 (1996) and U.S. Patent 5,593,853), small organic molecule 
libraries (benzodiazepmes, Baum, C&EN, Jan 18, page 33 (1993); thiazolidinones and 
metathiazanones, U.S. Patent 5,549,974; pynrolidines, U.S. Patents 5,525,735 and 

25 5,519,134; moipholino compounds, U.S. Patent 5,506,337; benzodiazqpines, 5,288,514, 
and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available {see, e.g., 357 MPS, 390 MPS (Advanced Chem Tech, Louisville KY), 
Sjmiphony (Rainin, Wobum, MA), 433 A (Applied Biosystems, Foster City, CA), 9050 
30 Plus (Millipore, Bedford, MA)). In addition, numerous combinatorial libraries are 
themselves commercially available {see, e.g., ComGenex, Princeton, NJ; Tripos, Inc., 
St. Louis, MO; 3D Pharmaceuticals, Exton, PA; Martek Biosciences; Columbia, MD; 
etc.). 
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In one aspect of the invention, the TIR modulators can be used in any food 
product, confectionery, pharmaceutical composition, or ingredient ttiereof to thereby 
modulate the taste of the product, composition, or ingredient in a desired itnanner. For 
instance, TIR modulators that enhance sweet taste sensation can be added to sweeten a 
5 product or composition; TIR modulators that enhance umami taste sensation can be 
added to foods to increase savory tastes. Altematively, TIR antagonists can be used to 
block sweet and/or umami taste. 

b. Kits 

10 TIR genes and their homologs are useful tools for identifying chemosensory 

receptor cells, for forensics and paternity determinations, and for examining taste 
transduction. TIR family member-specific reagents that specifically hybridize to TIR 
nucleic acids, such as TIR probes and primers, and TIR specific reagents that 
specifically bind to a TIR polypeptide, e.g,^ TIR antibodies are used to exanune taste 

1 S cell expression and taste transduction regulatioiL 

Nucleic acid assays for the presence of DNA and RNA for a TIR family 
member in a sample include numerous techniques are known to those skilled in the art, 
such as southern analysis, northem analysis, dot blots, RNase protection, SI analysis, 
ampUfication techniques such as PGR, and in situ hybridization. In in situ 

20 hybridization, for example, the target nucleic acid is Uberated fiom its cellular 
surroundings in such as to be available for hybridization within the cell while 
preserving the cellular moiphology for subsequent interpretation and analysis. The 
following articles provide an overview of the art of in situ hybridization: Singer et aL, 
Biotechniques, 4:230250 (1986); Haase et aL, Methods in Virology, vol. VH, pp. 

25 1 89-226 (1 984); and Nucleic Acid Hybridization: A Practical Approach (Names et aL, 
eds. 1987). In addition, a TIR polypeptide can be detected with the various 
inununoassay techniques described above. The test sample is typically compared to 
both a positive control (e.g., a sample expressing a recombinant TIR polypeptide) and a 
negative control 

30 The present invention also provides for Idts for screening for modulators of TIR 

family members. Such kits can be prepared fi'om readily available materials and 
reagents. For example, such kits can comprise any one or more of the following 
materials: TIR nucleic adds or proteins, reaction tubes, and instructions for testing 
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TIR activity. Optionally, the kit contains a biologically active TIR receptor or cell line 
that stably or transiently expresses a biologically active TIR containing taste receptor. 
A wide variety of kits and components can be prepared according to the present 
invention, depending upon the intended user of the kit and the particular needs of the 
5 user. 



EXAMPLES 

While the invention has been described in detail supra, the following examples 
are provided to illustrate preferred embodiments. These examples are intended to be 
10 illustrative and not limitative of the scope of the invention. 

In the protein sequences presented herein, the one-letter code X or Xaa refers to 
any of the twenty common amino acid residues, in the DNA sequences presented 
herein, the one letter codes N or n refers to any of the of the four common nucleotide 
bases,A,T,C,orG. 

15 

. Example 1 

Production of Intronless hXlR Expression Constructs 

Intronless hTlR expression constructs were cloned by a combination of cDNA- 
based and genomic DNA-based methods. To generate the full-length hTlRl 

20 expression construct, two 5' coding exons identified in a cloned hTlRl interval 
(accession # AL159177) were combined by PCR-overlap, and then joined to a 5'- 
truncated testis cDNA clone. The hTlR2 expression construct was generated from a 
partially sequenced hTlR2 genomic interval. Two missing hTlR2 5' exons were 
identified by screening shotgun libraries of the cloned genomic interval using probes 

25 derived firom the corresponding rat coding sequence. Coding exons were then 

combined by PCR-overlap to produce the full-length expression construct. The hTlR3 
expression construct was generated by PCR-overlap fix)m a sequenced hTlR3 genomic 
interval (accession # AL139287). Rat T1R3 was isolated fix>m a rat taste tissue-derived 
cDNA library using an rTlR3 exon fragment generated by hTlR3-based degenerate 

30 PCR. The partial hTlRl cDNA, rTlR2 cDNA, and partial hTlR2 genomic sequences 
were obtained jfrom Dr. Charles Zuker (University of California, San Diego). 
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The nucleic acid and amino acid sequences for fhe above-identified TIR cloned 
sequences as well as other full-length and partial TIR sequences are set forth in the 
sequence listing. 

Also, the following conceptual translations, which correspond to the C-termini 
5 of two orthologous pairs offish TlRs, are derived fi*om unpubUshed genomic sequence 

fragments and provided. Fugu TIRA was derived from accession 'scaffold 164'; Fugu 

TIRB was derived from accession LPC6171 1; Tetradon TIRA was derived from 

accession AL226735; Tetradon TIRB was derived from accession AL22238 1 . 

Ambiguities in the conceptual translations ('X*) result from ambiguities in database . 
1 0 sequences. These sequences can be found in tide sequence listing. 

Additionally, the accession number and reference citations relating to mouse 

and rat TlRs and allelic variants thereof in the public domain are is set forth below: 

rTlRl (Accession # AAD18069) (Hoon et al.. Cell 96 (4): 541-51 (1999)); 

rTlR2 (Accession # AAD18070) (Hoon et al., Cell 96(4): 541-59 (1999)); 
15 mTlRl (Accession # AAK39437); mTlR2 (Accession #AAK 39438); 

mTlR3 (Accession AAK 55537) (Max et al., Nat Genet. 28(1): 58-63 (2001)); 

rTlRl (Accession # AAK7092) (Li et al., Mamm. Genome (12(1): 13-16 (2001)); 

mTlRl (Accession # NP 1 14073); mTlRl (Accession # AAK07091) (Li et al., Mamm. 

Genome (121): 13-16 (2001)); rTlR2 (Accession # AAD18070) (Hoon et al., CeU 
20 9664): 541-551 (1999)); mTlR2 (Accession # NPH4079); mTlR3 (Accession # 

AAK39436); mTlR3 (Accession # BAB47181); (Kitagawa et al., Biochem. Biophys. 

Res. Comm. 283(l):236-42 (2001)); mTlR3 (Accession #NP1 14078); mTlR3 

(Accession # AAK55536) (Max et al, Nat. Genet 28(l):58-63 (2001)); and mTlR3 

(Accession No. AAK01937). 

25 

iSxample 2 

Sequence Alignment of Human and Rat TlRs 

Cloned TIR sequences selected from those identified above were aligned 
against the coiresponding rat TlRs. As shown in Figure 1, human TlRl, human T1R2 
30 and human T1R3 and rat T1R3 were aligned with previously described TlRs (rTlRl 
having Accession # AAD18069 and rTlR2 having Accession # AAD18070), the rat 
mGluRl metabotropic, glutamate receptor (Accession # P23385); and the human 
calcium-s^ing receptor (Accession #P41 1 80). For clarity of the comparison, the 
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mGluRl and calcium-sensing recqjtor C-tennini are truncated. The seven potential 
transmembrane segments are boxed in blue. Residues that contact the glutamate side- 
chain carbutylate in the mGluRl crystal structure are boxed in red, and residues that 
contact the glutamate a-amino acid moiety are boxed in green. The mGluRl and 
5 calcium-sensing receptor cysteine residues implicated in intersubunit disulfide-based 
formation are circled in purple. These cysteines are not conserved in TlRl and T1R2, 
but are located in a degraded region of the alignment that contains a potentially 
analogous T1R3 cysteine residue, also circled. 

10 Example 3 

Demonstration by RT-PCRthat hTlR2 and hTlR3 are expressed In taste tissue 
As shown in Figure 2, hTlR2 and hTlR3 are expressed in taste tissue: 
expression of both genes can be detected by RT-PCR from resected human 
circumvallate papillae. 

15 

Example 4 

Methods for Heterologous Expression of TlRs in Heterologous Cells 

An HEK-293 derivative (Chandrashekar et al., Cell 100(6): 703-1 1 (2000)), . 
which stably expresses Gal 5, was grown and maintained at 37°C in Dulbecco's 

20 Modified Eagle Medium (DMEM, Gibco BRL) supplemented with 10% FBS, MEM 
non-essential amino acids (Gibco BRL), and 3 fig/ml blasticidin. For calcium-imaging 
ejiperiments, cells were first seeded onto 24-well tissue-culture plates (approximately 
0.1 million cells per well), and transfected by lipofection with Minis TransIt-293 
(PanVera). To minimize glutamate-induced and glucose-induced desensitization, 

25 supplemented DMEM was replaced with low-glucose DMEM/GlutaMAX (Gibco 
BRL) approximately 24 hours after transfection. 24 hours later, cells were loaded with 
the calcium dye Fluo-4 (Molecular Probes), 3pM in Dulbecco's PBS bufifer (DPBS, 
GibcoBRL), for 1.5 hours at room temperature. After replacement with 250pJ DPBS, 
stimulation was performed at room temperature by addition of 200pl DPBS 

30 supplemented with taste stimuli. Calcium mobilization was monitored on a Axiovert 
SI 00 TV microscope (Zeiss) using Imaging Workbench 4.0 software (Axon). 
T1R1/T1R3 and T1R2/T1R3 responses were strikingly transient - calcium increases 
rarely persisted longer than 15 seconds - and asynchronous. The number of responding 
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cells was thus relatively constant over time; therefore, cell responses were quantitated 
by manually counting the number of responding cells at a fixed time point, t>pically 30 
seconds after stimulus addition. 

5 Example 5 

Human T1R2/T1R3 functions as a sweet taste receptor 

HEK cells stably expressing Gal 5 were transientiy transfected with human 
T1R2, T1R3 and T1R2/T1R3, and assayed for increases in intracellular calcium in 
response to increasing concentrations of sucrose (Figmre 3(a)). Also, T1R2/T1R3 dose 

10 responses were determined for several sweet taste stimuli (Figure 3(b)). The maximal 
percentage of responding cells was different for different sweeteners, ranging jfrom 10- 
30%. For clarity, dose responses were normalized to the maximal percentage of 
respondmg cells. The values in Figure 3 represent the mean ± s.e. of four independent 
responses. X-axis circles mark psychophysical detection thresholds determined by 

IS taste testing. Guimaiin (50-fold dilution of a filtered lOg/1 Gymnemd sylvestre aqueous 
extract) inhibited the response of T1R2/T1R3 to 250 mM sucrose, but not the response 
of endogenous P2-adrenergic receptor to 20 yM isoprotwenol (Figure 3(b)). Figure 
3(c) contains the normalized response of T1R2/T1R3 co-expressing cell lines to 
different sweeteners(sucrose, aspartame, D-tryptophan and saccharin). 
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Example 6 

Rat T1R2/T1R3 also functions as a sweet taste receptor 

HEK cells stably expressing Gal 5 were transiently transfected with 
5 hTlR2/hTlR3, rTlR2/rTlR3, hTlR2/rTlR3, and rTlR2/hTlR3, These transfected 
cells were then assayed for increased intracellular calcium in response to 350 mM 
sucrose, 25 mM tryptophan, 15 noGVI aspartame, and 0.05% of monellin. The results 
with sucrose and aspartame are contained in Figure 4 and indicate that rTlR2/rTlR3 
also functions as a sweet taste receptor. Also, these results suggest that T1R2 may 
10 control TlR2/TlR31igand specificity. 

Example? 

T1R2/T1R3 responses using an automated fluorescence based assay 

HEK cells stably e3q)ressing Gal5 were transiently transfected with hTlR2 and 
15 hTlR3. These cells were loaded with the calcium dye Fluo-4, and their responses to a 
sweetener measured using a fluorescence plate reader. Figure 5 contains cyclamate 
(12.5 mM) responses for cells expressing hTlR2/hTlR3 and for cells expressing only 
hTlR3 (J19-22). The fluorescence results obtained indicate that responses to these 
taste stimuli only occurred in the cells expressing hTlR2/hTlR3. Figure 6 contains 
20 nonnaUzed dose-response curves, the results of which show that hTlR2 and hTlR3 
function togetho* as a human taste receptor based on their dose-specific interaction witia 
various sweet stimuli . Particularly, Figure 6 contains dose-responses for sucrose, 
tryptophan and various other commercially available sweeteners. These results indicate 
that T1R2/T1R3 is a human sweet taste receptor as flie rank order and threshold values 
25 obtained in the assay closely mirror values for human sweet taste. 
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Example 8 

Ligand-binding residues of mGlaRl are conserved in TlRl 
As shown in Figure 6, the key ligand-bindmg residues of mGluRl are 
5 conserved in TlRl . The interaction of glutamate with mGluRl is shown with several 
key residues highlighted according to the same color scheme as Figure 1. 

Example 9 

Human T1R1/T1R3 functions as umami taste receptors 
1 0 HEK cells stably expressing Gal 5 were transiently transfected with human 

TlRl, T1R3 and T1R1/T1R3 and assayed for increases in intracellular calcium in 
response to increasing concentrations of glutamate (Figure 8(a)), and 0.5 mM 
glutamate), 02 mM IMP, and 0.5 mM glutamate plus 0.2 mM IMP (Figure 8(b)). 
Human T1R1/T1R3 dose responses were detennined for glutamate in the presence and 
1 5 absence of 0.2 mM IMP (Figure 8(c)). The maximal percentages of responding cells 
was approximately 5% for glutamate and approximately 10% for glutamate plus IMP. 
For clarity, does responses are normalized to the maximal percentage of responding 
cells. The values represent the mean + s.e. of four independent responses. X-axis 
circles mark taste detection thresholds determined by taste testing. 

20 

Example 10 
PDZIP as an Export Sequence 

The six residue PDZIP sequence (SVSTW (SEQ ID NO:l)) was fused to the C- 
tenninus of hTlR2 and the chimeric receptor (i.e. hTlR2-PDZIP) was transfected into 

25 an HEK-293 host cell. The surface expression of hTlR2 was then monitored using 
immunofluorescence and FACS scaiming data. As shown in Figures 9A and 9B, the 
inclusion of the PDZIP sequence increased the surface expression of hTlR2-PDZIP 
relative to hTlR2. More specifically, Figure 9A shows an immunofluorescence 
staining of myc-tagged hTlR2 demonstrating that PDZIP significantly increases the 

30 amount of hTlR2 protem on the plasma membrane. Figure 9B shows FACS analysis 
data demonstrating the same result- Cells expressing myc-tagged hTlR2 are indicated 
by flie dotted Une and cells expressing myc-tagged hTlR2-PDZIP are indicated by the 
solid luie. Particularly, Figure lOA shows untransfected Oal5 stable host cells in HBS 
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buffer, Figure lOB shows hTlR2-PDZIP transfected Gal5 stable hose cells in 
sweetener pool no. 5 (saccharin, sodium cyclamate, Acesulfame K, and Aspartame-20 
mM each in HBS buffer), Figure IOC shows T1R3-PDZIP transfected Gal5 stable host 
ceUs in sweetener pool no. 5, and Figure lOD shows hTlR2-PDZIP/hTlR3-PDZIP co- 
5 transfected Gal5 stable host cells in sweetener pool no. 5. Further, Figures lOE-lOH 
show dose-dependent response of hTlR2/hTlR3 co-transfected Gal5 stable host cells 
to sucrose-E: OmM in HBS buffer; F: 30 mM; G: 60 mM; and H: 250 mM. Figures 
lOI-lOL shown the responses of hTlR2/hTlR3 co-transfected Gal 5 stable host cells to 
individual sweeteners - 1: Aspartame (l.S mM); J: Acesulfame K (1 mM); K: Neotame 
10 (20mM); L: Sodium cyclamate (20mM). As demonstrated by the calcium-images of 
Figure 10, hTlR2 and hTlR3 are both required for flie activities triggered by the sweet 
stimuli. 

Example 11 

IS Generation of Cell Lines that Stably Co-Express T1R1/T1R3 or T1R2/T1R3 

Human cell lines that stably co-express human T1R2/T1R3 or human 
T1R1/T1R3 were generated by transfecting linearized PEAKlO-derived (Edge 
Biosystems) vectors and pCDNA 3.1/ZEO-derived (Invitrogen) vectors respectively 
containing hTlRl or hTlR2 expression construct (plasmid SAV2485 for TlRl, 

20 SAV2486 for T1R2) and hTlR3 (plasmid SXV550 for T1R3) into a Gais expressing 
cell line. Specifically, T1R2/T1R3 stable cell lines were produced by co-transfecting 
linearized SAV2486 and SXV550 mto Aurora Bioscience's HEK-293 cell line that 
stably expresses Gais. T1R1/T1R3 stable cell lines were produced by co-transfecting 
linearized SAV2485 and SXV550 mto the same HEK-293 cell line that stably 

25 expresses Gais. FoUowmg SAV2485/SXV550 and SAV2486/SXV550 transfections, 
puromycin-resistant and zeocin-resistant colonies were selected, expanded, and tested 
by calcium imaging for responses to sweet or umami taste stimuli* Cells were selected 
in 0.0005 mg/nd puromycin (CALBIOCHEM) and 0,1 mg/ml zeocin (Invitrogen) at 
ST^'C m low-glucose DMEM supplemented with GltitaMAX, 10% dialyzed FBS, and 

30 0.003 mg/ml blasticidin. Resistant colonies were expanded, and their responses to 

sweet taste stimuli evaluated by Fluorescence microscopy. For automated fluorimetric 
imaging on VIPR-II instrumentation (Aurora Biosciences), T1R2/T1R3 stable cells 
were first seeded onto 96-well plates (approximately 100,000 cells per well). Twenty- 
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four hours later, cells were loaded witii the calcium dye fluo-3-AM (Molecular Probes), 
0.005 mM in PBS, for one hour at room temperature. After replacement with 70 jil 
PBS, stimulation was performed at room temperature by addition of 70 ^1 PBS 
supplemented with taste stimuli. Fluorescence (480 nm excitation and 535 nm 
5 emission) responses from 20 to 30 seconds following compound addition were 

averaged, corrected for background fluorescence measured prior to compound addition, 
and normalized to the response to 0.001 mM ionomycin (CALBIOCHEM), a calcium 
ionophore. 

It was then observed that when these cell lines were exposed to sweet or umami 

1 0 stimuli, that for active clones typically 80-100% of cells responded to taste stimuli. 

Unexpectedly, the magnitude of individual cell responses was markedly larger than that 
of transiently transfected cells. 

Based on this observation, the inventors tested the activity of TIR stable cell 
lines by automated fluorescence imaging using Aurora Bioscience's VTPR 

15 instrumentation as described above. The responses of two T1R1/T1R3 and one 
T1R2/T1R3 cell line are shown m Figure 11 and Figure 12 respectively. 

Remarkably, the combination of increased numbers of responding cells and 
increased response magnitudes resulted in a greater than 10-fold increase in activity 
relative to transiently transfected cells. (By way of comparison, the percent ionomycin 

20 response for cells transiently transfected with T1R2/T1R3 was approximately 5% under 
optimal conditions.) Moreover, dose responses obtained for stably expressed human 
T1R2/T1R3 and T1R1/T1R3 correlated with human taste detection thresholds. The 
robust TIR activity of these stable cell lines suggests that they are well suited for use in 
high-throughput screening of chemical hbraries in order to identify compoimds, e.g. 

25 small molecules, that modulate the sweet or umami taste receptor and which therefore 
modulate, enhance, block or mimic sweet or umami taste. 

Example 12 

Generation of cell lines that indncibly co-express T1R1/T1R3 which selectively 
30 respond to umami taste stimuli 

T1R1/T1R3 HEK 293 cell lines that stably expressed the umami taste receptor 
display robust improved activity relative to transiently tranfected cites. However, a 
disaidvantage is that they can rapidly lose activity during cell propagation. 
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Also, these jSndings show that (i) T1R1/T1R3 is a mnami taste receptor, i.e., 
and (ii) that cell lines which robustly express T1R1/T1R3, preferably stable and/or 
inducible T1R1/T1R3 cell lines can be used in assays, preferably for high throughput 
screening of chemical libraries to identify novel modulators of umami taste. 
S Modulators that enhance umami taste may be used* 

To overcome the instability of the T1R1/T1R3 stable cell lines, the HEK-Gai5 
cells have been engineered to indiicibly express T1R1/T1R3 using the GeneSv^^ 
system (Invitrogen). pGene-derived zeocin-resistant e}q)ression vectors for human 
TlRl and T1R3 (plasmid SXV603 for TlRl and SXV61 1 for T1R3) and a puromycin- 

1 0 resistant pSwitch-derived vector that carries the GeneSwitch protein (plasmid SXV628) 
were linearized and cotransfected into 1heHEK-Gai5 cell line, Zeocin-resistant and 
puromycin-resistant colonies were selected, expanded, induced with variable amounts 
of mifepristone, and tested by calcium imaging for responses to umami taste stimuli. 
Inducible expression of T1R1/T1R3 resulted in robust activity. For example, 

15 approximately 80% of induced cells but only approximately 10% of transiently 

transfected cells responded to L-glutamate; More specifically, pGene derived Zeocin- 
resistant expression vectors that express human TlRl and human T1R3 and a 
puromycn-resistant pSwitch-derived vector that carries the GeneSwitch protein were 
linearized and co-tranfected into Gai5 cells. Cells were selected in 0.5 jig/ml 

20 puromycin (CAL BIOCHEM) and 100 fig/ml Zeocin (Invitrogen) at 37^C in 

Dulbecco*s Modified Eagle Medium supplemented with GlutaMAX, (10 % dialyzed 
FBS, and 3 ug^ml blasticidin. Resistant colonies were expanded, and their responses to 
umami taste stimuli following induction with 10"*^M mifepristone determined by 
fluorescence microscopy following the methods of Li et al., PNAS 99(7): 4692-4696 

25 (2002). 

For automated fluorometric imaging on FLPR instrumentation (Molecular 
Device), cells from one clone (designated clone M7) were seeded into 96-well plates 
(approximately 80,000 cell per well) in the presence of 10"^° M mifepristone and 
incubated for 48 hours. Cells were then loaded with the calcium dye fluo-4-AM 
30 (Molecular Probes), 3 jiM in PBS, for 1 .5 hours at room temperature. 

After replacement with 50 jxl PBS, stimulation was performed at room 
temperature by the addition of 50 ^1 PBS supplemented with different stimuli, la 
contrast to previous transient T1R1/T1R3 umami receptor expression systems that 
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necessitated quantifying T1R1/T1R3 receptor activity by individually counting 
responding cells (Li et al., PNAS 99(7): 4692-4696 (2002)) (because of the low activity 
of the receptor therein), the subject inducible expression system resulted in a clone 1-17 
having substantially increased activity that allowed receptor activity to be quantified by 
5 determining maximal fluorescence increases (480 nm excitation and 535 nm emission) 
summated over fields of imaged cells. The maximal fluorescence firom four 
independent detenninations was averaged, corrected for background fluorescence 
measured prior to compound addition, and normalized to the response to 0.002 mM 
ionomycin (CALBIOCHEM). 

10 These results are contained in Figure 13. Particularly, Figure 13 contains a 

dose-response curve determined for L-glutamate in the presence and absence of 0.2 
mM IMP. In the figure, each value represents average sunomated maximal fluorescence 
(corrected for backgroimd fluorescence) for four independent detenninations. These 
dose-response curves correspond to those determined for cells transiently transfected 

15 withTlRl/TlR3. 

The selectivity of the umami T1R1/T1R3 taste receptor was also evaluated by 
screening with different L-amino acids. The results obtained indicated that T1R1/T1R3 
is selectively activated by the umami-tasting L-amino acids (L-glutamate and L- ^ 
aspartate). 

20 The results of experiments wherein the responses of the 1-17 clone was resulted 

in tested in the presence of different L-amino acids are contained in Figure 14 and 
Figure 15, Figure 14 shows the results of an experiment wherein the 1-17 cell line was 
contacted with different L-amino acids at a concentration of lOmM in the presence and 
absence of ImM IMP. 

25 Figure 15 contains a dose-response curve for active amino acids determined in 

the presence of 02wM IMP. Each value represents the average of four independent 
determinations. 

The results obtained in these experiments support the specificity and selectivity 
of the umami taste receptor to umami taste stimuli. Whereas the umami taste stimuli L- 
30 glutamate and L-aspartate significantly activated the T1R1/T1R3 receptor at different 
concentrations (see Figure 14 and 15), the other L-amino acids which activated the 
human T1R1/T1R3 receptor only activated the receptor weakly and at much higher 
concentrations. 
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Therefore, these results support the selectivity of the T1R1/T1R3 receptor for 
uinami taste stimuli and the suitability of tiiis inducible stable expression system for use 
in high throughput screening assays using automated fluorometric imaging 
instrumentation to identify compounds that activate tiie umami taste receptor, for 
5 example L-glutamate or L-aspartate , or which enhance the activity of L-glutamate to 
activate the umami taste receptor, for example 5'-IMP or 5'-GMP, or block the 
activation of the umami taste receptor by umami taste stimuli such as L-glutamate and 
L-aspartate. 

Compounds identified using these assays have potential application as 
10 flavorants in foods and beverage compositions for mimicing or blocking umami taste 
stimuli. 

Example 13 

Lactisole Inhibits the Receptor Activities of Human T1R2/T1R3 and T1R1/T1R3 , 

IS and Sweet and limami Taste 

Lactisole, an aralkyl carboxylic acid, was thought to be a selective sweet-taste 
inhibitor (See e.g., Lindley (1986) U.S. Patent 4,567,053; and Schif&nan et al. Chem 
Senses 24:439-447 (1999)). Responses of HEK-Gais cells transiently transfected with 
T1R2/T1R3 to 150 mM sucrose in the presence of variable concentrations of lactisole 

20 were measured. Lactisole inhibits the activity of human T1R2/T1R3 with an IC50 of 24 

The T1R1/T1R3 umami and T1R2/T1R3 sweet taste receptor may share a 
common subunit. It has therefore been theorized that lactisole, which inhibit the 
T1R2/T1R3 sweet taste receptor, may have a similar effect on the T1R1/T1R3 umami 

25 taste receptor. The present inventors tested the effect of lactisole on the response of 
human T1R1/T1R3 to lOmM L-Glutamate. As with the TlR2/nR3 sweet receptor, 
lactisole inhibited T1R1/T1R3 with an IC50 of 165 pM. Lactisole inhibition likely 
reflects antagonism at the TIR receptors instead o^ for example, non-specific 
inhibition of Gais-mediated signaling because the response of muscarinic acetylcholine 

30 receptors was not inhibited by lactisole. 

The present inventors then evaluated die effect of lactisole on human umami 
taste. Taste thresholds m the presence of 1 and 2 mM lactisole were determined for ttie 
umami taste stimuli L-Glutamate with or without 0.2 mM IMP, the sweet taste stimuli 
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sucrose and D-tryptophan, and the salty taste stimulus sodium chloride following the 
methods of Schifi&nan et al. (Chem. Senses 24: 439-447 (1989)). Millimolar 
concentrations of lactisole dramatically increased detection tiiresholds for sweet and 
umami but not salt taste stimuli. These results are contained in Figure 16. 
5 In conclusion, (i) these findings fiirflier support the inventors* hypothesis that 

T1R1/T1R3 is the only umami taste receptor, and (ii) the T1R1/T1R3 and T1R2/T1R3 
receptors may share a structurally related lactisole-binding domain. 

While the foregoing detailed description has described several embodiments of 
the present invention, it is to be understood that the above description is illustrative 
10 only and not limiting of the disclosed invention. The invention is to be limited only by 
the claims which follow. 

Example 14 

Mapping of Ligand Interaction Sites on tlie Sweet Receptor 

15 

Through coexpression of T1R2R-H with human T1R3, part of the human sweet 
receptor (the N-termmal domain of T1R2) was replaced with vat protein sequence. The 
responses to aspartame and neotame are abolished, showing that the N- terminal 
domain of human T1R2 is required for recognizing aspartame and neotame. Similarly, 

20 the rat T1R2 N-tenninal domain was also replaced with human protein sequence by 
co^ressmg T1R2H-R with rat T1R3. The chimeric receptor gains the ability to 
respond to aspartame and neotame, suggesting that the same domain of human T1R2 is 
also sufficient (in the context of sweet receptors) to recognize those two sweeteners 
(Fig. 22B). These in vitro functional expression data indicate that the important 

25 mteraction determinants are located in the N-terminal extracellular domain. 

In contrast, replacing either half of human T1R2 with rat protein sequence 
does not affect its response to cyclamate. Instead, the C-terminal domain of human 
T1R3 is required and sufficient, when co-expressed with T1R2, to recognize cyclamate 
(Fig. 22C). The transmembrane domain of family C GPCRs has been known to 

30 contain binding sites for allosteric modulators (Gasparini, F„ R. Kuhn, and J.P. Pin, 
Curr Opin Pharmacol 2002 Feb;2(l):43-9). This is flie first case in family C GPCSR, 
where an agoiiist binds directly to the transmembrane domain and activates the 
receptor in the absence of oflier ligand. . 
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Lactisole, an aralkyl carbdyxlic acid, is a specific human sweet taste 
inhibitor, which has physiological effect on the rodent taste. Consistent with the taste 
effect, lactisole inhibits the human but not rat T1R2/T1R3 response to sucrose in our 
assay system (Fig. 22A). The same kind of mapping experiments on lactisol 
5 interaction site using the TIR chimeras was performed. Like cyclamate, lactisole 
requires flie human T1R3 C-terminal domain to inhibit the receptor's response to 
sucrose and acesulfame K 

(Fig. 22D). This result further demonstrates the importance of T1R3 C-terminal 
domain in the sweet receptor function. The chimeras in all 16 possible combinations 
10 were tested, and lall functional combinations generated results consistent with our 
model. 

Mutagenesis studies were conducted on both T1R2 and T1R3 to narrow down 
the essential amino acidis in recognition of aspartame, neotame, and cyclamate. If 
T1R2 and T1R3 are responsible for recognizing dlifferent sweeteners, mutations in 

1 S T1R2 N-terminal domain would affect responses to aspartame and neotame, but not 
cyclamate. hi addition, mutations in T1R3 C-terminal domain would have the opposite 
effect. To select the crucial amino acid residues in the T1R2 N-terminal domain, the 
sequence of T1R2 was aligned with mGluRl (Fig. 23A). Among the eight residues 
that are crucial in Ugand binding in mGluRl (Kunishima, N., et al., Nature, 2000. 

20 407(6807): p. 971-7), three are conserved in human T1R2 (S144, Y218, and E302). 
Each of the three residues were mutated and the resulting receptors were tested for 
their response to different sweeteners. Substitution of Y21 8 to A aboHshed the 
responses to all sweeteners tested, showing Y218 is important for the overall 
conformation of the receptor. The two other hTlR2 variants, containing S144A and 

25 E302A, selectively affected the response to aspartame and neotame but not cyclamate. 
Stable cell lines expressing S144A and E302A hTlR2 variants (coej^ressed with wild 
type hTlR3 and Gais) did not respond to aspartame or neotame at the physiological 
concentrations, but did respond to cyclamate (Fig. 23B). 

In order to further map the cyclamate-binding site, the fliree extracellular loops 

30 in the T1R3 C-terminal domain were focused on. AUgmnent of human and rodent 
TlR3s reveal multiple amino add differences in the three extracellular loops (Fig. 
23Q. Replacmg extracellular loop-2 or loop-3 with rat sequences abolished flie 
cyclamate response without affecting the sucrose or aspartame responses, hi contrast. 
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replacing extracellular loop 1 had no obvious effect on response to cyclamate, showing 
an important role for EC loops 2 and 3 in recognizing cyclamate (Fig. 23D). None of 
those loop-replacements affected the inhibition effect of lactisole, showing a different 
binding mechanism. In summary, amino acid substitutions in T1R2 or T1R3 result in 
5 selective interference of activities induced by different sweeteners, consistent with the 
chimeric receptor results. 

The above results demonstrate that the human sweet receptor function as a 
heteromeric cotqpl^ of T1R2 and T1R3. Both subunits are required for recognizmg 
different sweeteners, and the data indicate the existence of multiple binding pockets 
10 on the receptor for different classes of agonists. The presence of multiple ligand- 
binding sites provides structural guidance and definition for the specifically binding 
compounds of the invention. 

Example 15 
Mapping of Receptor-G protein Interactions 

1 5 The human and rat sweet receptors are also different in flieir G protein-coupling 

efficiency. Even though both hunian and rat receptors can couple efficiently to Gaisnu 
only the human receptor can couple efficiently to Gais (Fig. 24A). This species 
difference allows for mappiug of the receptor G protein interactions using the same 
chimeric receptors as described above. T1R2 but not T1R3 appears to be critical for 

20 Gai5-coi5)ling, since replacing the C-terminus of human T1R2 with the corresponding 
rat sequence abolished coupling, and replacing rat T1R2 C-terminal half with human 
sequence enabled the receptor to couple to Gais and respond to sucrose and acesulfame 
K (Fig. 24); Swapping the T1R3 C-terminal sequences had no ejBTect on Gais-coxipling 
(Fig. 24B). This observation demonstrates the important role of T1R2 in G protein- 

25 coupluig in the functional expression system. Gustdudn (Wong, G.T., K.S. Gannon, 
and R.F. Margolskee, Nature, 1996. 381(6585): p. 796-800) has been proposed to be an 
endogenous G protein for the sweet taste receptor, and T1R2 can be the subunit 
responsible for in viyo coupling in taste cells. GABAbR is the other example of 
heteromeric femily C GPCR, whereas one subunit (GABAbRI) is responsible for 

30 ligand- bmding, and the other (GABAbR2) for G protein coupling (Margeta-Mitrovic, 
M., Faroe Natl Acad Sci U S A, 2001, 98(25): p. 14643-8; Margeta-Mitrovic, M., Proc 
Natl Acad Sci U S A, 2001. 98(25): p. 14649-54). The sweet receptor is different fi'om 
GABAbR in that T1R2 is required for both ligand recognition and G- protein coupling. 
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Example 16 

Lactisole Antagonizes Human T1R1/T1R3 and Inhibits Human Umami 

Taste 

5 It was hypothesized that since T1R1/T1R3 function as heteromeric receptors 

as well as the sweet receptor, that lactisole should have similar effect on T1R1/T1R3 
activity, since T1R3 is a common subunit between the sweet and the umami receptors. 
Indeed, lactisole antagonized human T1R1/T1R3 (Fig. 25A). Lactisole acts as a 
noncompetitive inhibitor of T1R1/T1R3, since the ICso values are ^parently not 

10 dependent on glutamate concentration (Fig. 25B), and lactisole reduces the maYimfll 
activities of the recq)tor without significantly changing the ECso of agonists (Fig, 
25Q. These results demonstrate that lactisole binds to a different site from L- 
glutamate, and are consistent with the hypothesiis that the glutamate-binding pocket is 
located in TlRl . Lactisole appears to be a competitive inhibitor of the sweet receptor, 

15 as its ICsos are dependent on the concentrations of the sweeteners, and it increases the 
ECsos of the sweeteners without significantly affecting the maximal activities. 

The inhibition effect of lactisole is mediated by the TIR receptors since it had 
no effect on the endogenous muscarinic acetylcholine receptor in HEK cells or on a 
mouse bitter receptor, mT2R5, transiently expressed in HEK cells. As was the case 

20 for the T1R2/T1R3 receptor, lactisole inhibition of the T1R1/T1R3 response to 
umami taste stimuli was reversible following washout and restimulation. 

To correlate the receptor activity with behavior, the effect of lactisole on 
human xmiami taste was tested. As predicted, millimolar concentrations of lactisole 
dramatically increased detection thresholds for sweet and umami but not salt taste 

25 stimuli (Fig. 25D). Lactisole was previously not known to be an umami taste mhibitor. 
The correlation between receptor activity and taste results demonstrates a crucial role 
of TlRs in human umami taste. 

Example 17 

Cyclamate Enhances Human T1R1/T1R3 Receptor Activities 

30 Based on the same heteromeric model of TlRs (Fig. 26), it was predicted that 

cyclamate would also modulate the activity of the human T1R1/T1R3 umami receptor 
by acting on T1R3 . Although cyclamate alone had no effect on T1R1/T1R3, it 
enhanced tiie activity of the receptor in the presence of L-glutamate (Fig. 27E), This 
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effect is specific for fhc human T1R1/T1R3, as cyclainate had no effect on the 
activities of the endogenous muscarinic acetylcholine receptor in the presence of 
carbachol (Fig. 27E). It is noteworthy thiat cyclamate has comparable ECsos for the 
sweet receptor (Fig. 23B) and umami receptor. Cyclamate reproducibly left-shifts the 
5 dose-response curves for L-glutamate by 

^2 fold in the presence or absence of IMP (Fig. 25F). IMP has a more dramatic 
effect of enhancing the receptor, and the effect of cyclamate is observed in the 
presence of IMP (Fig. 25F), suggesting a different mechanism from IMP in 
enhancing the recq)tor. IMP appears to bind to TlRl, since it has no effect on the 

10 sweet receptor. Other sweeteners, includmg sucrose, aspartame, saccharin, and D- 
tryptophan, had no effect on the human TIRIATIRS activities. 

In summary, it has been demonstrated that both T1R2 and T1R3 are required 
in a fimctional sweet receptor, that aspartame and neotame reijuire the N-terminal 
extracellular domain of T1R2, G protein-coupling requires C-terminal half of T1R2, 

15 and that cyclamate and lactisple require the transmembrane domain of T1R3. These 
findings demonstrate the different functional roles of TIR subunits in a heteromeric 
complex and the presence of multiple sweetener interaction sites on the sweet 
receptor. Because T1R3 is the common subxmit in the sweet and the umami receptors, 
it was predicted and confirmed the effect of cyclamate and lactisole on the umami 

20 receptor. Furthermore, a correlation was able to be made between the lactisole effect 
on the receptor activities with taste. Based on these observations, a model was created 
(Fig, 26) for the structure-fimction relationships of the TIR family taste receptors. 
Natural carbohydrate sweeteners bind to the N-terminal domain of T1R2, similar to 
aspartame and neotame, and there are other hgand binding sites on the sweet receptor 

25 as well, for example, the transmembrane domain of T1R2. The vamam xecGptor 

functions similarly as a heteromeric conq)lex, and MSG and IMP each ^peacs to bind 
to the TlRl subunit, since neither has any effect on the sweet i^eptor, and the 
transmembrane domain of TlRl is responsible for coupling to G proteins. 
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Example 18 

HTS Protocol for Sweet Tastants 
An HEK293 cell line derivative (Chandrashekar, I, kueller, KX., Hoon, MA., 
Adler, B., Feng, L., Guo, W., Zuker, C.S., Ryba, N J.,. CelJ 2000, 100, 703-71 1.) that 
5 stably expresses Gal5 and hTlR2/hTlR3 (Li, X., Staszewski, L., Xu, H., Durick, KL, 
ZoUer, M., Adler, K Proc Natl Acad Sci USA 2002, 99, 4692-4696, World Patent # 
WO 03/001876 A2, herein incorporated by reference in their entirety) was used to 
identify compounds with sweet taste enhancing properties. 

Compounds were initially selected based on their activity on the hTlR2/hTlR3- 

10 HEK293-Gal5 cell line (Li, et al. vide supra). Activity was determined using an 

automated fluorometric imaging assay on a FLIPR instrument (Fluorometric Intensity 
Plate Reader, Molecular Devices, Sunnyvale, CA) (designated FLIPR assay). Cells 
from one clone (designated S-9 cells) were seeded into 384-well plates (at 
approximately 50,000 cells per well) in a medium containing DMEM Low Glucose 

15 (Invitrogen, Carlsbad, CA), 10% dialyzed fetal bovine senmi (Invitrogen, Carlsbad 
CA), 100 Units/ml Penicillin G, and 100 |ig/ml Streptomycin (Invitrogen, Carlsbad, 
CA) (li, et al. vide supra) see also World Patent #W0 03/001876 A2). S-9 cells were 
grown for 24 hours at 37 °C. S-9 cells were then loaded with the calcium dye Fluo- 
3AM (Molecular Probes, Eugene, OR), 4 pM in a phosphate buffered saline (D-PBS) 

20 (Invitrogen, Carlsbad, CA), for 1 hour at room temperature. After replacement with 25 
pi D-PBS, stimulation was performed in the FLIPR instrument and at room temperature 
by the addition of 25 jil D-PBS supplemented with different stimuli at concmtrations 
corresponding to twice the desired final level. Receptor activity was quantified by 
detemiining the maximal fluorescence increases (using a 480 nm excitation and 535 nm 

25 emission) after normaUzation to basal fluorescence intensity measured before 
stimulation. 

For dose-respohses analysis, stimuh were presented in duplicates at 10 different 
concentrations ranging from 60 nM to 30 |iM. Activities were normalized to the 
response obtained with 400 mM D-fiixctose, a concentration that elicits maximum 
30 receptor response. EC50S were determined using a non-linear regression algorithm 
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(using Seuomyx, Inc. software), where the HUl slope, bottom asymptotes and top 
asymptotes were allow to vary. Identical results were obtained when analyzing the 
dose-response data using commercially available software for non-linear regression 
analysis such as GraphPad PRISM (San Diego, CA). 

5 In order to determine the dependency of hTlR2/hTlR3 for the cell response to 

different stimuli, selected compounds were subjected to a similar analysis on HEK293- 
Gal5 cells (not expressmg the human sweet receptor). The HEK293-Gal5 cells do not 
show any functional response in the FLIPR assay to D-Fructose or any other known 
sweeteners. Similarly, compounds described herein do not induce any functional 
10 response when using HEK293-Gal5 cells in the FLIPR assay. 

Example 19 

Flavor Enhancement Measurements for Sweet Tastants using Human Volunteers 

Basic screening of sensory taste testers: Potential panelists were tested for their 
abilities to rank and rate intensities of solutions representing the five basic tastes. 

15 Panelists ranked and rated intensity of five different concentrations of each of the five 
followmg compounds: sucrose (sweet), sodium chloride (salty), citric acid (sour), 
caffeine (bitter), and monosodium glutamate (umami). Panelists tasted a total of 25 
samples per session (5 samples of each of the 5 solution types). In the first session, 
panelists ranked the five concentrations for intensity of the attribute in question. This 

20 was repeated four more times with other samples, in the second session, panelists rated 
intensity of the five concentrations of each sample using a line scale called the 
"Labeled Magnitude Scale" (LMS). The IMS is anchored with intensities (e.g. barely 
detectable, weak, moderate, strong, very strong, and strongest imaginable) to assist 
panelists in rating the samples. Samples were tasted in IQml volumes at room 

25 temperature and labeled with 3-digit blinding codes. Samples were presented in 
randomized, counterbalanced order within each sample solution (e.g. sucrose, citric 
acid, etc.). 

In order to be selected for participation in testing, panelists needed to correctly 
rank and rate samples for intensity, with a reasonable number of errors. Approximately 
30 25 people successfully completed this procedure. 

Panelists selected in tiie above procedure were deemed qualified for performing 
Preliminary Taste Testing procedures. The preliminary taste tests are used to evaluate 
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new compoimds for intensity of basic tastes and oflF-tastes. A small group of panelists 
(n=5) taste approximately 5 concentrations of the compound (range typically between 
1-lOOuM, in half-log cycles, e.g. 1, 3, 10, 30, and lOOuM) in water or buffer and in a 
solution of 4% (wA^, 1 17 wM) sucrose to evaluate enhancement. Typically samples 
5 also contain 0. 1% ethanol in order to aid dispersion of the compound in a water-based 
solution. Panelists rate the five basic tastes (sweet, salty, sour, bitter, and umami) as 
well as ofif-tastes (such as chemical, metallic, sulfur) on the LMS. Samples are served 
in 10ml portions at room temperature. The purpose of the test is to determine the 
highest concentration at which there is no objectionable ofp-taste, and detennine if 
10 obvious enhancement of sweet taste exists at any of the concentrations tested. 

If the compound is effective and does not have objectionable off-tastes, it is 
tested with a trained (expert panel) in a larger study. 

For example; Five panelists evaluated 1, 3, 10, 30, and lOOuM XVI-3 in water 
and in 4% sucrose solution. All samples with compound were balanced for ethanol at 
15 0.1% (aids in dispersion of compound). Panelists were asked to rate basic tastes and 
off-tastes using the LMS for each sample tasted. When panelists noted sweetness in 
any sample, they were asked to taste reference samples of sua:ose (2, 4, 6, 8% sucrose) 
to estimate equivalent sweetness. 

A trained (expert) panel was used to further evaluate compounds that had been 
20 tested with the preliminary taste test. 

Panelists for the tramed panel were selected fi-om the larger group of qualifying 
taste panelists. Panelists were furtlier trained on sweet taste by ranking and rating 
experiments using sucrose solutions. Panelists completed a series of ranking, rating, 
and difference from reference tests with sweet solutions. In rankmg and rating 
25 experiments, panelists evaluated sucrose concentrations (2, 4, 6, 8 % (w/v)) sucrose. 

CompouQds tested by the trained panel were evaluated in difference &om 
reference experiments. Panelists were given reference samples of various 
concentrations (2,4,6, or 8 % (w/v) sucrose) and asked to rate samples on a scale of -5 
to +5 in terms of difference in sweet taste firom the reference (score: -5= much less 
30 sweet taste than the reference; 0=same sweet taste as flie reference; +5=much more 
sweet taste than the reference). Test samples were solutions with varying amounts of 
sucrose and compound. Typically, each session conq)ared the reference sample 
(labeled as REF) to numerous test samples (labeled with 3-digit blinding codes). Tests 
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typically included various samples with varying concentrations of sucrose, as well as 
one blind sample of the reference itself, to evaluate panel accuracy. Compoxmds were 
tested against the reference in samples with and without 4% or 6% sucrose. All 
samples were presented in 10ml volumes at room temperature. Futhermore, to 
5 determine the sweetness of the compound alone, a reference solution was prepared at 
HiG designated concentration and compared to the threshold sweetness of sucrose (2%). 

Example 20 
HTS Protocol for Umami Tastante 
HEK-Gai5 cells were engineered to inducibly express T1R1/T1R3 using the 
10 GmeSwitch system (Ihvitrogen). pGene-derived zeocin-resistant expression vectors 
for human TlRl and T1R3 (plasmid SXV603 for TlRl and SXV61 1 for T1R3) and a 
puromycin-resistant pSwitch-derived vector that carries the GfeneSwitch protein 
(plasmid SXV628) were linearized and cotransfected into the HEK-Gais cell line. 
Zeocin-resistant and puromycin-resistant colonies were selected, expanded, induced 
15 with variable amounts of mifepristone, and tested by calcium imaging for responses to 
umami taste stimuli. Cells were selected in 0.5 |Lig/ml puromycin (CAL BIOCHEM) 
and 100 jig/ml Zeocin (Invitrogen) at 37°C in Dulbecco's Modified Eagle Medium 
supplemented with GlutaMAX, (10 % dialyzed FBS, and 3 ug/ml blasticidin. Resistant 
colonies were expanded, and their responses to umami taste stimuli following induction 
20 with 1 0"^^ M mifepristone detennined by fluorescence microscopy following flie 
methods of li, et aL, PNAS (2002) 99(7):4692-4696, For automated fluorometric 
imaging on FLIPR instrumentation (Molecular Device), cells from one clone 
(designated clone 1-17) were seeded into 96- or 384-well plates (approximately 80,000 
cell per well) in the presence of 10"^° M mifepristone and incubated for 48 hours. Cells 
25 were then loaded with the calcium dye fluo-4-AM (Molecular Probes), 3 pM m PBS, 
for 1 .5 hours at room temperature. After replacement with 50 |il PBS, stimulation was 
perfonned at room temperature by the addition of 50 \il PBS supplemented with 
different stimuli. The maximal fluorescence from four independent determinations 
were averaged, corrected for background fluorescence measured prior to compound 
30 addition, and normalized to the response to 0.002 mM ionomycin (CALBIOCHEM). 

Example 21 
Taste Test Protocol for Umami Tastants 
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Basic Training of Sensory Tasters: Tiasters were trained to evaluate the taste of 
aqueous solutions (5 mL each, "swash and spit") of the following standard taste 
compounds by using the triangle test as described in the literature: sucrose (50 mM) for 
sweet taste; citric acid (5 mM) or lactic acid (20 mM) for sour taste; NaCl (12 mM) for 
5 salty taste, quinine (10 jiiM) or caffeine (1 mM) for bitter taste; and monosodium 
glutamate (8 mM) for umami or "savory** taste. 

Training for Umami Taste: Tasters were given 1-3 sets of 6 MSG and/or MSG- 
IMP samples ranging from 3-60 mM MSG and 0-200 ^iM IMP, each arranged in the 
tray in ascending concentration. This exercise gave the subject practice doing dose 
10 response evaluations. Then another set was made up of the same six samples, but were 
given in random order. The subject was then asked to airange the samples in ascending 
intensity and then to rate their umami intensity. 

Qualifying Taste Panelists: Tasters were subjected to a standard two alternative 
forced choice (2AFC) test with 5 pairs of taste samples. They were asked to make a 
15 choice of the most umami sample from two samples (a pair). The test contains two 
easy pairs, two with medium difficulty, and one difficult pair. Tasters who could 
differentiate the medium difficulty pairs were selected as panelists. 

Pilot/Qualitative taste test of Umami Enhancer Candidate (UEC) by a small 
group of panelists: Taste samples of appropriate concentrations (usually 1-50 iiM) 
20 were made in water (use minimum amount of ethanol if not soluble); Taste UEC alone 
at 30 and/or 50 ^M for umami and other attributes. Rate those taste attributes on the 
appropriate Labeled Magnitude Scale (LMS) on the screening ballot; if UEC has 
no/low umami and other tastes, then move forward to discrimination test; compare 
certain concentration of MSG, ag., 12 mM and 12 mM MSG + 30 jiM UEC to 
25 determine if there is any enhancement; rate the perceived umami intensity on the 
appropriate LMS on the screening ballot; vary concentration of UEC and/or MSG to 
find the best combination; decide what solutions to use in panel screening; record all 
procedures and data including description of study, sample prep, sample arrangement, 
ballots and sign up sheet for panelists, data entry and evaluation. 

30 2AFC Panel Screening ofUEC: Run panel screening with qualified panelists 

using protocols generated fix>m the pilot tasting; record all procedures and data; prepare 
summary report with statistically significant conclusions, if any. 
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Example 22 

Quantitative Taste Tests for Compounds 2725761 and 3756807 

Quantitative taste tests for compounds 2725761 and 3756807 were inn 
according to procedures presented above. It was found that both of them have some 
5 enhancement for MSG, in addition to their additive effect of the umami intensity. 

Example 23 
Synthesis of Compounds 2725761 and 3756807 

Compounds 2725761 and 3756807 are prepared as shown in Example 22, from 
10 their corresponding acids and amines. The products are purified by conventional 

methods, basic and acidic aqueous washes, or preparative HPLC. The structures 
of those compounds were confirmed based on usual analytical methods, NMR 
and LCMS. This method can also be used to s3/nthesize any of the compounds found in 
Tables 1-5. 
15 . 

Example 24 
CeU Based Assays 
Cells were grown and maintained at 37°C in Dulbecco's Modified Eagle 
Medium (DMEM) supplemented with 10% FBS and MEM non-essential amino acids 

20 (Gibco BRL); media for Gais cells also contained 3 \ig ml"^ blasticidin (Gibco BRL). 
For calcium-imaging experiments, cells were first seeded onto 48-well tissue-culture 
plates (approximately 30,000 cells per well), and transfected using Minis TransIt-293 
(PanVera). Transfection efficiencies, which were estimated by cotransfection with an 
RFP expression vector, were typically approximately 60%. To minimize glutamate- 

25 induced and glucose-induced desensitization, supplemented DMEM was replaced with 
low-glucose DMEM supplemented with GlutaMAX and 10% dialyzed FBS (Gibco 
BRL) approximately 24 hours after transfection. After an additional 24 hours, cells 
were loaded with the calcium dye fluo-4-AM (Molecular Probes), 3 pM in Dulbecco's 
PBS buffer (DPBS, GibcoBRL), for 1.5 hours at room temperature. After replaces 

30 with 100 nl DPBS, stimulation was performed at room temperature by addition of 100 
|xl DPBS siq)plemented with taste stunuli. Calcium mobilization was monitored on an 
Axiovert S 100 microscope equipped with an mverted lOX/0.5 LWD piano fluor 
objective (Zeiss) and a cooled CCD camera (Princeton Instruments). Fluorescence 
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images were acquired at 480 nm excitation and 535 mn emission, and analyzed with 
Imaging Workbench 4.0 software (Axon Instruments). TIR receptor activity was 
quantitated by counting the number of responding cells 30 seconds after stimulus 
addition. 
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What is claimed: 

1. A non-naturally occurring compound that specifically binds to a T1R2/T1R3 
receptor composed of hTlR2/hTlR3 but not rTlR2/rTlR3. 

2. A non-naturally occurring compound that specifically binds to a T1R2/T1R3 
receptor composed of hTlR2/rTlR3 but not iTlR2/hTlR3. 

3. A non-naturally occurring compoimd that specifically binds to the N-tenninal 
extracellular domain of T1R2 of ttie hTlR2/hTlR3 receptor. 

4. A non-naturally occurring comp<^und that specifically binds to a T1R2/T1R3 
receptor composed of friR2/hTiR3 but not hTlR2/iTlR3. 

5. A non-naturally occurring compound that specifically binds to hTlR2/hTlR3 
and rTlR2/r3-h3 but not to rTlR2/rTlR3 or to hTlR2/h3-r3. 

6. A non-naturally occurring compound that specifically binds to hTlR2/hTlR3 
and r2-h2/rTlR3 but not to rTlR2/rTlR3 or to h2-r2/hTlR3. 

7. A non-naturally occurring compound that specifically binds to the Venus 
Flytrap Domain (VFD) of T1R2 of the hTlR2/hTlR3 and hTlR2/rTlR3 
receptor. 

8. A non-naturally occurring compound that specifically binds to amino acid 
residues 144 and 302 of the human N-terminal Venus flytrap domain of the 
T1R2 subunit of theTlR2/TlR3 receptor. 

9. A non-naturally occurring compound ttmt specifically binds to ±e N-tenninal 
Venus flytrap domain of flie T1R2 subunit of the T1R2/T1R3 receptor, wherein 
the compound is about 12x5x5 angstroms. 
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10. A non^naturally occuiiing compound that specifically binds to ttie cystdne-iich 
region of T1R2 of the hTlR2/hTlR3 receptor. 

1 1 . A non-naturally occurring compound that specifically binds to the 
Transmembrane Domam (TM) of T1R2 of the hTlR2/hTlR3 receptor. 

12. A non-naturally occumng compound that specifically binds to the human N- 
terminal extracellular domain of the T1R3 subunit of the T1R2/T1R3. 

13. A non-naturally occurring compound that specifically binds to the Venus 
Flytrap Domain (VFD) of T1R3 of the hTlR2/hTlR3 receptor. 

14. A non-natuially occurring compound that specifically binds to the 
Transmembrane Domain (TM) of T1R3 of the hTlR2/hTiR3 receptor. 

15. A non-naturally occurring compound that specifically binds to extraceUular 
loop 2 and ^tracellular loop 3 of the human transmembrane domain of the 
T1R3 subunit of T1R2/T1R3. 

16 . The compound of any one of claims 1-15 that demonstrates compound- 
dependent increase in fluorescence with an activity compared to the maximal 
activity for fiixctose of at least 25% in a fluorescence-based assay using a 
FLIPR (Molecular Devices) instrument. 

17. The compound of any of claims 1-16 that demonstrates a compound-dependent 
decrease in the EC50 for a sweetener by at least two-fold in a fluorescence-based 
assay using a FLIPR (Molecular Devices) instrument. 

18. The compound of any one of claims 1-17 tiiat results in at least 10 out of 100 
cells transfected with wild-type or cliimeric receptor showing a compound- 
dependent increase in fluorescence. 
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19. The compound of aay one of claims 1-18 that demonstrates a compound- 
dependent increase of at least two-fold of the nimiber of fluorescent ceUs in 
response to a sub-maximal level of a sweetener. 

20. The compound of any one of claims 1-16 that demonstrates a corrq)ound- 
dependent increase m the response of cells to a sub-maximal level of a 
sweetener of at least 1 .25-fold compared to the response to the sweetener alone. 

21. The compound of claim 20, wherein fhe response is measured by fluorescence, 
calcium levels, IP3 levels, cAMP levels, GTPyS binding, or reporter gene 
activity (e.g. luciferase, beta-galactbsidase), 

22. The compound of any one of claims 1-21, having one or more of the following 
characteristics in a cell: 

decreased EC50 compared to a control of at 50%, 
increased intracellular Ca2+ levels by at least approximately 25%, 
increased intracellular cAMP by at least approxiniately 25%, 
increased intracellular cGMP by at least approximately 25%, 
increased mtracellular IP3 by at least approximately 25%, or 
mcaeased G protein binding of GTPyS by at least approximately 25%. 

23. A chimeric T1R2/T1R3 receptor comprismg, a human T1R2 subunit and a lat 
T1R3 subunit. 

24. A chimeric T1R2/T1R3 receptor comprising, a rat T1R2 subunit and a human 
T1R3 subunit. 

25. A chimeric T1R2 receptor subunit comprising, a human extracellular domain, a 
rat transm^brane domain and a rat intracellular domain. 

26. A chimeric T1R3 receptor subimit comprising, a rat extracellular domain, a 
human transmembrane domain and a hmnan intracellular domain. 
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27. A non-naturally occurring compound that binds to the N-tenninal extracellular 
domain of TlRl of the TlRl/hTlRS receptor. 

28. A non-naturally occurring compound that binds to the TIRIVFD of the 
TIRI/TIRS savory receptor. 

29. A non-naturally occurring compound fliat binds to the cysteine-rich region of 
TlRl of the TlRl/hTlR3 rec<?>tor. 

30. A non-naturaUy occurring compound that binds to the TlRl TM domain of the 
TlRl/riR3 savory receptor. 

3 1 . A non-naturally occurring compound that binds to the N-terminal extracellular 
domain of T1R3 of the TlRl/hTlR3 receptor. 

32. A non-naturally occurring compound that binds to the T1R3 VFD of the 
T1R1/T1R3 savory receptor. 

33. A non-naturally occuning compound that binds to the cysteine-rich region of 
TlR3of1heTlRl/hTlR3 receptor. . , 

34. A non-naturally occuning compound that binds to the T1R3 TM domain of the 
T1R1/T1R3 savory recq)tor. 

35. A non-naturally occurring confound that binds to the TM domain of TlRl of a 
truncated savory receptor conqwsed of the hlTM/h3TM. 

36. A non-naturally occurring compound that binds to ttie TM domain of T1R3 of a 
truncated sweet receptor composed of hlTM/h3TM. 

37. A non-naturally occurring compound that binds to the TM domain of TlRl of a 
chimeric receptor composed of mGluR-hl/mGluR-h3. 
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38. A non-naturally occurring compound that binds to the TM domain of T1R3 of a 
chimeric recq)tor composed of mGluR-*hl/mGluR-h3. 

39. The compound of any one of claims 27-38 tiiat demonstrates compound- 
dependent increase in fluorescence with an activity compared to the maximal 
activity of glutamate of at least 25% in a fluorescence-based assay using a 
FLIPR (Molecular Devices) instrument. 

40. The compound of any one of claims 27-38 that demonstrates a compound- 
dependent decrease in the EC50 for glutamate by at least two-fold in a" 
fluorescence-based assay using a FLIPR (Molecular Devices) instrument. 

41 . The compound of any one of claims 27-38 that results in at least 10 out of 100 
transfected cells showing a compound-dependent increase in fluorescence 
measured with a fluorescent microscope. 

42. The compound of any one of claims 27-38 that results in a compoundr 
dependent increase of at least two-fold of the number of fluorescent cells in 
response to a sub-maximal level of glutamate. 

43. The compound of any one of claims 27-38 that results in a compound- 
dependent increase in the response of cells to a subr-maximal level of glutamate 
of at least 1 .25-fold compared to the response to glutamate alone. 

44. The compound of claim 43, wherein the response is measured by fluorescence, 
calcium levels, IP3 levels, cAMP levels, TTIlyS binding, or reporter gene 
activity. 

45. A method for identifying compounds that modulate taste perception by 
identifying compounds that bind to, activate, inhibit, enhance and/or modulate 
one or more of flie receptors of any one of claims 23-26. 



155 



wo 2005/015158 



PCT/US2004/025459 



46. A method for identifying a compound that modulates sweet taste perception 
comprising comparing the effect of the compound on a sweet receptor to the 
effect of a compound of any one of claims 1-22, an enhancement of sweet 
perception approximately equal to or greater than the sweet enhancement of the 
compound indicating a compound that enhances sweet perception. 

47. A method for identifying a compound that modulates umami taste perception 
comprising comparing the effect of the compound on an umami receptor to the 
effect of a compound of any one of claims 27-44, an enhancement of savory 
perception approximately equal to or greater than the savory enhancement of 
the compound indicating a compound that enhances umami perceptioiL 

48. A method for identifying compoimds that modulate taste perception by 
identifying compounds that bind to, activate, inhibit, and/or modulate a receptor 
expressed by a cell that stably expresses one or more of the receptors of any one 
of claims 23-26. 

49. A method for modulating the savory taste of a comestible or medicinal product 
comprising: 

providing at least one comestible or medicinal product, or a precursor 
thereof, and 

combining the comestible or medicinal product or precursor thereof with at 
least a savory flavor modulating amoimt of at least one non-naturally 
occurring compound of any one of claims 27-44, or a comestibly 
acceptable salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby modulating the savory taste of a comestible or medicinal product. 

50. A method for inhibiting the savory taste of a comestible or medicinal product 
comprising: 

providing at least one comestible or medicinal product, or a precursor 
thereof and 
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combining the comestible or medicinal product or precursor thereof with at 
least a savory flavor inhibiting amount of at least one non-naturally 
occurring compound of any one of claims 27-44, or a comestibly 
acceptable salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby inhibiting the savory taste of a comestible or medicinal product. 

5 1 . A method for increasing the savory taste of a comestible or medicinal product 
comprising: 

providing at least one comestible or medicinal product, or a precursor 
thereof, and 

combining the comestible or medicinal product or precursor thereof with at 
least a savory flavor increasing amount of at least one non-naturally 
occurring compound of any one of claims 27-44, or a comestibly 
acceptable salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby increasing die savory taste of a comestible or medicinal product. 

52. A method for modulating the sweet taste of a comestible or medicinal product 
comprising: 

providing at least one comestible or medicinal product, or a precursor 
thereof, and 

combining the comestible or medicinal product or precursor thereof with at 
least a sweet flavor modulating amount of at least one non-naturally 
occurring compound of any one of claims 1-22 and 60-61, or a comestibly 
acceptable salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby modulating the sweet taste of a comestible or medicinal product. 

53. A method for inhibiting the sweet taste of a comestible or medicinal product 
comprising: 
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providing at least one comestible or medicinal product, or a precursor . 
thereof, and 

combining the comestible or medicinal product or precursor thereof witii at 
least a sweet flavor inhibiting amount of at least one non-naturally 
occurring compound of any one of claims 1-22 and 60-61, or a comestibly 
acceptable salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby inhibiting the sweet taste of a comestible or medicinal product. 

54. A method for increasing the sweet taste of a comestible or medicinal product 
comprising: 

providing at least one comestible or medicinal product, or a precursor 
thereof, and 

combining the comestible or medicinal product or precursor thereof with at 
least a sweet flayor increasing amount of at least one non-naturally 
occurring compound of any one of claims 1-22 or 60-61, or a comestibly 
acGq)table salt thereof, so as to form a modified comestible or medicinal 
product; 

thereby increasing the sweet taste of a comestible or medicinal product. 

55. A method of enhancing umami taste perception comprising contacting an 
umami receptor with cyclamate and NHDC, and their derivatives. 

56. A method of enhancing umami taste perception comprising contacting an 
umami receptor with lactisole derivatives. 

57. A method of enhancing sweet taste perception comprising contacting a sweet 
receptor with cyclamate and NHDC, and their derivatives. 

58. A method of enhancing sweet taste perception comprising contacting a sweet 
receptor with lactisole derivatives. 

59. The compound of any one of claims 27-44, wherem the compound is not 
sucrose, fructose, glucose, erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, 
certain known natural terpenoids, flavonoids, or protein sweeteners, di-peptides, 
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tri-peptides aspartame, saccharin, sucralose, halogenated saccharides, 
acesulfame-K, cyclamate, sucralose, alitame, monosodium glutamate CMSG"), 
inosine monophosphate (IMP), adenosine monopho^hate, or guanosine 
monophosphate (GMP), 

60. The compound of any one of claims 1-22, wherein the compound is not sucrose, 
fructose, glucose, erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, certain 
known natural terpenoids, flavonoids, or protein sweeteners, di-peptides, tri- 
peptides, aspartame, saccharin, sucralose, halogenated saccharides, acesulfame- 
K, cyclamate, sucralose, and alitame. neotame, periUartine, SC-45647, SC- 
40014, monellin, NC-002740-01, thaumatin, CC-00100, NC-00420, alitame, 
SC-44102, dulcin, NC-00576, slycyrrhizic Acid, stevioside, Na-Saccharin, Dr 
tryptophan, cyclamate, DHB, glycolic Acid, glycine, D (-)fractose, 
homofuronol, D (-) tagatose, maltose, D (+) glucose, D-sorbitol, D (+) 
galactose, a-lactose, LQfructose, L (+), compound 403249, or glucose. 

61. The compound of any one of claims 1-22, wherein the compound is not sucrose, 
fructose, glucose, erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, certain 
known natural terpenoids, flavonoids, or protein sweeteners, di-peptides, tri- 
peptides, aspartame, saccharin, sucralose, halogenated saccharides, acesulfame-. 
K, cyclamate, sucralose, and alitame. neotame, periUartine, SC-45647, SC- 
40014, monellin, NC-002740-01, thaumatin, CC-00100, NC-00420, alitame, 
SC-44102, dulcin, NC-00576, slycyrrhizic Acid, stevioside, Na-Sacchaiin, D- 
tryptophan, cyclamate, DHB, glycolic Acid, glycine, D (-)fructose, 
homofuronol, D (-) tagatose, maltose, D (+) glucose, D-sorbitol, D (+) 
galactose, a-lactose, LQfructose, L (+), compound 403249, glucose, or 
Compound 6364395. 

62. The compound of any one of claims 27-44, wherein the compound is not 
sucrose, fructose, glucose, erythritol, isomalt, lactitol, mannitol, sorbitol, xylitol, 
certain known natural terpenoids, flavonoids, or protein sweeteners, di-peptides, 
tri-peptides aspartame, saccharin, sucralose, halogenated saccharides, 
acesulfame-K, cyclamate, sucralose, alitame, monosodium glutamate CTMSG"), 
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inosine monophosphate (IMP), adenosine monophosphate, or Compound 
6364395, guanosine monophosphate (GMP). 
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Figure 1: Catalog of human and rat i iks 
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Figure 2 hT1R2 and hT1R3 are expressed in liuman tongue 
epithelium. cDNA-specific amplification products can be 
amplified from cDNA prepared from resected human 
circumvailate papillae. 
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Figure 3 Human T1R2/T1R3 functions as a sweet taste 
receptor 
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Figure 4 T1 R2 may control T1 R2/T1 R3 ligand specificity 
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Figure 5 
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Figure 7 Key ligand^blnding residues of mGlurRI are 
conserved in T1R1 
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Figure 8 Human T1R1/T1R3 functions as an umami taste 
receptor 
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Figure 9 PDZIP facilitate the surface expression of human 
T1R2. 

A. Immunofluorescence staining of Myc-tagged hT1R2 
indicates that PDZIP significantly increases the 
amount of human T1 R2 protein on the plasma 
membrane. 

B. FACS analysis data demonstrating the same result. 

Myc-tagged human T1R2: Green line. Myc- 
tagged 



C. 



human T1R2 with PDZIP: black line. 
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Figure 10 CalciumHmaging data demonstrating hT1R2/hT1R3 
responses to a number of sweet stimuli. 
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Figure 11 



hT1R1 * hT1R3, constant IMP (O.SmM) 
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Figure 1 6 Lactlsole inhibits th© T1R2/riR3 sweet and T1R1/T1R3 umaml receptors and 
sweet and umaml taste. {Lefi panel) responses of HEK-Gois cells transiently transfected with 
T1R1/riR3 {circles) to 10 mM L-glutamate and HEK-GeiB cells transiently transfected with 
T1R2/T1R3 {squares) to 150 mM sucrbse in the presence of variable concentrations of lactlsole 
are shown. (Right panel) fold Increases In taste detection thresholds In the presence of 1 and 2 mM 
lactlsole are shown for the sweet taste stimuli sucrose and D-tryptophan, the umaml taste stimun L- 
glutamate (MSG) and L-glutamate plus 02 mM IMP, and sodium chloride. Detection thresholds 
were detenmined following the method of Schfflman et al. 
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Figure 18 
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Figure 19 
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Figure 20 
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Figure 21 A 

Sucrose 
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Figure 21B 
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SEQUENCE LISTING 

<110> Senomyx, Inc. 

<120> TIR HETBRO-OIiIGOMERIC TASTE RECEPTORS, 

CELIj IiINES that express said RECEPTORS; AND TASTE COMPOUNDS 



<130> 19328. OOOlul 

<140> Unassigned 
<141> 2004-08-06 

<150> 60/494,071 
<151> 2003-08-06 

<150> 60/552,064 
<151> 2004-03-09 

<160> 33 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 5 
<212> PRT 

<2 13 > Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note » 
synthetic construct 

<400> 1 

Ser Val Ser Thr Trp 
1 5 

<210> 2 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note ■= 
synthetic construct 

<221> VARIANT 

<222> 1,3,4,6,7,8,11,12,13 

<223> At position 1, Xaa can be either Thr or Arg 
At position 3, Xaa can be either Phe or Leu 
At position 4, Xaa can be either Arg, Gin, or Pro 
At position 6, Xaa can be either Arg or Thr 
At position 7, Xaa can be either Ser, Pro or Val 
At position 8, Xaa can be either Val, Glu, Arg, 
Lys or Thr 

At position 11, Xaa can be either Ala or Glu 
At position 12 , Xaa can be either Trp or Leu 
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At position 13, Xaa can be either Arg, His or Gly 
<400> 2 

Xaa Cys Xaa Xaa Arg Xaa Xaa Xaa Fhe Leu Xaa Xaa Xaa Glu 
15 10 

<210> 3 
<211> 15. 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<221> VARIANT 

<222> 1,3,4,7,9,10,11,13,14,15 

<223> At position 1, Xaa can be either Leu or Gin 

At position 3, Xaa can be either Glu, Gly or Thr 
At position 4, Xaa can be either Asn, Arg or Cys 
At position 7, Xaa can be either Arg or Glu 
At position 9, Xaa can be either Arg or Lys 
At position 10, Xaa can be either Cys, Gly or Phe 
At position 11, Xaa can be either Val, Leu or lie 
At position 13,. Xaa can be either Phe or Leu 
At position 14, Xaa can be either Ala or Ser 
At position 15, Xaa can be either Met or Leu 

<400> 3 

Xaa Pro Xaa Xaa Tyr Asn Xaa Ala Xaa Xaa Xaa Thr Xaa Xaa Xaa 
1 5 10 15 

<210> 4 
<211> 858 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 4 



Met 


Pro 


Gly 


Leu 


Ala 


lie 


Leu 


Gly 


Leu 


Ser 


Leu 


Ala 


Ala Phe Leu Glu 


1 






5 










10 






15 


Leu 


Gly 


Met 


Gly 


Ser 


Ser 


Leu 


Cys 


Leu 


Ser 


Gin 


Gin 


Phe Lys Ala Gin 








20 










25 








30 


Gly 


Asp 


Tyr 


lie 


Leu 


Gly 


Gly 


Leu 


Phe 


Pro 


Leu 


Gly 


Thr Thr Glu Glu 






35 










40 










45 


Ala 


Thr 


Leu 


Asn 


Gin 


Arg 


Thr 


Gin 


Pro 


Asn 


Gly 


He 


Leu Cys Thr Arg 




50 










55 










60 




Phe 


Ser 


Pro 


Leu 


Gly 


Leu 


Phe 


Leu 


Ala 


Met 


Ala 


Met 


Lys Met Ala Val 


65 










70 










75 




80 


Glu 


Glu 


lie 


Asn 


Asn 


Gly 


Ser 


Ala 


Leu 


Leu 


Pro 


Gly 


Leu Arg Leu Gly 










85 










90 






95 


Tyr 


Asp 


Leu 


Phe 


Asp 


Thr 


Cys 


Ser 


Glu 


Pro 


val 


Val 


Thr Met Lys Pro 




100 










105 








110 


Ser 


Leu 


Met 


Phe 


Met 


Ala 


Lys 


Val 


Gly 


Ser 


Gin 


Ser 


He Ala Ala Tyr 



2 
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115 








120 


Cys 


Asn 


Tyr 


Thr 


Gin 


Tyr 


Gin Pro 




130 










135 


His 


Ser 


Ser 


Glu 


Leu 


Ala 


Leu lie 


145 










150 




Leu 


Met 


Pro 


Gin 


Val 


Ser 


Tyr Ser 










165 






Arg 


Glu 


Thr 


Phe 


Pro 


Ser 


Phe Phe 








180 








6ln 


Leu 


Gin 


Ala 


Val 


Val 


Thr Leu 






195 








200 


Val 


Ala 


Ala 


Leu 


Gly 


Ser 


Asp Asp 




210 










215 


lie 


Phe 


Ser 


Gly 


Leu 


Ala 


Asn Ser 


225 










230 




Gly 


Leu 


Val 


Pro 


Gin 


His 


Asp Thr 










245 






Val 


Asp 


Val 


Leu 


Arg 


Gin 


Val Asn 








260 








Leu 


Phe 


Ala 


Ser 


Ala 


Arg 


Ala Val 






275 








280 


Leu 


His 


Asp 


Leu 


Ser 


Pro 


Lys Val 




290 










295 


Thr 


Ser 


Asp 


Leu 


Val 


Met 


Thr Leu-* 


305 










310 




Val 


Leu 


Gly 


Phe 


Leu 


Gin 


Arg Gly 










325 






Tyr 


Val 


Glu 


Thr 


Arg 


Leu 


Ala Leu 








340 








Ser 


Leu 


Lys 


Ala 


Glu 


Leu 


Asp Leu 






355 








360 


Cys 


Ser 


Gin 


Cys 


Asp 


Tyr 


lie Met 




370 










375 


Met 


Gin 


Asn 


Leu 


Ser 


Ala 


Gly Gin 


385 










390 




Tyr 


Ala 


Ala 


Val 


Tyr 


Ser 


Val Ala 










405 






Cys 


Asn 


Val 


Ser 


His 


Cys 


His Thr 








420 






Leu 


Leu 


Glu 


Asn 


Met 


Tyr 


Asn Met 






435 








440 


Leu 


Gin 


Phe 


Asp 


Ala 


Lys 


Gly Ser 




450 










455 


Met 


Trp 


Val 


Trp 


Gin 


Ser 


Pro Thr 


465 










470 




Phe 


Asn 


Gly 


Thr 


Leu 


Gin 


Leu Gin 










485 






Asn 


Gin 


Val 


Pro 


Val 


Ser 


Gin Cys 








500 








Val 


Arg 


Arg 


Val 


Lys 


Gly 


Phe His 






515 








520 


Cys 


Lys 


Ala 


Gly 


Ser 


Tyr 


Arg Lys 




530 










535 


Pro 


Cys 


Gly 


Lys 


Asp 


Gin 


Trp Ser 


545 










550 




Pro 


Arg 


Arg 


Pro 


Lys 


Phe 


Leu Ala 



565 



125 



Arg 


Val Leu Ala 


Val lie Gly Pro 




140 




Thr 


Gly Lys Phe 


Phe Ser Phe Phe 




155 


16.0 


Ala 


Ser Met Asp 


Arg Leu Ser Asp 




170 


175 


Arg 


Thr Val Pro 


Ser Asp Arg Val 


185 




190 


Leu 


Gin Asn Phe 


Ser Trp Asn Trp 






205 


Asp 


Tyr Gly Arg 


Glu Gly Leu Ser 




220 




Arg 


Gly lie Cys 


lie Ala His Glu 




235 


240 


Ser 


Gly Gin Gin 


Leu Gly Lys Val 




250 


255 


Gin 


Ser Lys Val 


Gin Val Val Val 


265 




270 


Tyr 


Ser Leu Phe 


Ser Tyr Ser lie 






285 


Trp 


Val Ala Ser 


Glu Ser Trp Leu 




300 


Arg Val Gly 'Thr 


Pro 


Asn lie Ala 




315 


320 


Ala 


Leu Leu Pro 


Glu Phe Ser His 




330 


335 


Ala 


Ala Asp Pro 


Thr Phe Cys Ala 


345 




350 


Glu 


Glu Arg Val 


Met Gly Pro Arg 






365 


Leu 


Gin Asn Leu 


Ser Ser Gly Leu 




380 




Leu 


His His Gin 


lie Phe Ala Thr 




395 


400 


Gin 


Ala Leu His 


Asn Thr Leu Gin 




410 


415 


Ser 


Glu Pro Val 


Gin Pro Trp Gin 


425 




430 


Ser 


Phe Arg Ala 


Arg Asp Leu Thr 






445 


Val 


Asp Met Glu 


Tyr Asp Leu Lys 




460 




Pro 


Val Leu His 


Thr Val Gly Thr 




475 


480 


His 


Ser Lys Met 


Tyr Trp Pro Gly 




490 


495 


Ser 


Arg Gin Cys 


Lys Asp Gly Gin 


505 




510 


Ser 


Cys Cys Tyr 


Asp Cys Val Asp 






525 


His 


Pro Asp Asp 


Phe Thr Cys Thr 




540 




Pro 


Glu Lys Ser 


Thr Thr Cys Leu 




555 


. 560 


Trp 


Gly Glu Pro 


Ala Val Leu Ser 




570 


575 
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Xi6U 




Leu Leu Cys Leu 


Val 








580 












irxixs 


Val 


His Tyr Trp 


Asp 






595 








600 


Glv 


Ser 


Leu 


Phe 


Cys Phe 


Gly 


Leu 




610 








615 




Sex 


Val 


Leu 


Leu Phe Pro Gly 


Ara 


625 








630 






Gin 


Qln 


Pro 


Met 


Ala His 


Leu 


Pro 










645 






Phe 


IjCU 


Gin 


Ala Ala Glu 


lie 


Phe 








660 










Ala 


Ash 


Trp Leu Cys 


Ser 








675 








680 


Val 


Val 


Leu 


Leu Ala Thr 


Leu 


Val 




690 








695 




Leu. 


Met 


Ala 


Phe 


Pro Pro 


Glu 


Val 


705 








710 






Thr* 

X JUlJL 




Val 


Leu 


Glu His 


Cys 












725 








veix 


His 


lie 


Thr Asn Ala 


Val 








740 










JrilC? 




Val 


Gin Ser 


Gin 


Pro 






755 








760 


IjGU 


Tilt* 




Ala 


Met Leu 


Ala 






770 








775 




Piro 


ucu . 


. JJCU 


Ala 


Asn Val 


Gin 


Val 


785 








790 






vjr J. jr 


Ala 


lie 


Leu 


Phe Cys 


Ala 


Leu 










805 






Pro 


Lys 


Cys 


Tyr Val Leu Leu 


Trp 








820 








Phe 


Phe 


Leu 


Gly Arg Ser Pro 


Lys 






835 








840 


Ser 


Ser 


Glu 


Ala Thr Arg Gly 


His 




850. 








855 





<210> 5 
<211> 841 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 



synthetic 


construct 




<400> 5 






Met Leu Leu Cys 


Thr Ala Arg 


Leu 


1 


5 




Cys Cys Trp Ala 


Phe Ala Cys 


His 


20 






Thr Leu Pro Gly 


Asp Tyr Leu 


Leu 


35 




40 


Gly Cys Leu Gin 


Val Arg His 


Arg 


50 


55 




Ser Cys Ser Phe 


Asn Glu His 


Gly 



65 70 



Leu 


Gly 


Leu 


Thr 


Leu 


Ala 


Ala 


Leu 


585 










590 






Ser 


Pro 


Leu 


Val 


Gin 


Ala 


Ser 


Gly 










605 








He 


Cys 


Leu 


Gly 


Leu 


Phe 


Cys 


Leu 








620 










Pro 


Arg 


Ser 


Ala 


Ser 


Cys 


Leu 


Ala 






635 










640 


Leu 


Thr 


Gly 


Cys 


Leu 


Ser 


Thr 


Leu 




650 










655 




Val 


Glu 


Ser 


Glu 


Leu 


Pro 


Leu 


Ser 


665 










670 






Leu 


Arg 


Gly 


Pro 


Trp 


Ala 


Trp 


Leu 










685. 








Glu 


Ala 


Ala 


Leu 


Cys 


Ala 


Trp 


Tyr 








700 










Val 


Thr 


Asp 


Trp 


Gin 


Val 


Leu 


Pro 






715 










720 


Met 


Arg 


Ser 


Trp 


Val 


Ser 


Leu 


Gly 




730 










735 




Leu 


Ala 


Phe 


Leu 


Cys 


Phe 


Leu 


Gly 


745 










750 






Gly 

•* 


Arg 


Tyr 


Asn 


Arg 


Ala 


Arg 


Gly 










765 








Phe 


He 


He 


Trp 


Val 


Ser 


Phe 


Val 








780 










Ala 


Tyr 


Gin 


Pro 


Ala 


Val 


Gin 


Met 






795 










800 


Gly 


He 


Leu 


Ala 


Thr 


Phe 


His 


Leu 




810 










815 




Leu 


Pro 


Glu 


Leu 


Asn 


Thr 


Gin 


Glu 


825 










830 






Glu 


Ala 


Ser 


Asp 


Gly 


Asn 


Ser 


Gly 



845 

Ser Glu 



Sequence; note » 



Val 


Gly Leu 
10 


Gin 


Leu 


Leu 


He 
15 


Ser 


Ser 


Thr Glu 


Ser 


Ser 


Pro 


Asp 


Phe 


25 








30 






Ala 


Gly Leu 


Phe 


Pro 
45 


Leu 


His 


Ser 


Pro 


Glu Val 


Thr 
60 


Leu 


Cys 


Asp 


Arg 


Tyr 


His Leu 
75 


Phe 


Gin 


Ala 


Met 


Arg 
80 
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Leu Gly 


Val 


Glu 


Glu 


He 


Asn Asn Ser 


Thr 


Ala Leu Leu Pro Asn He 










85 






90 


. 95 


Thr 


Leu 


Gly 


Tyr 


Gin 


Leu 


Tyr Asp Val 


Cys 


Ser Asp Ser . Ala Asn Val 








100 






105 




110 


Tyr Ala 


Thr 


Leu 


Ara 


Val 


Leu Ser Leu 


Pro 


Gly Gin His His He Glu 






115 








120 




125 


Leu 


Gin 


Glv 


Asp 


Leu 


Leu 


His Tvr Ser 


Pro 


Thr Val Leu Ala Val He 




130 










135 




140 


Gly Pro 


ASD 


Ser 


Thr 


Asn 


Arcf Ala Ala 


Thr 


Thr Ala Ala Leu Leu Ser 


145 










150 






155 160 


Pro 


Phe 


Leu 


Val 


Pro 


Met 


Tie Ser Tvr 


Ala 


Ala Ser Ser Glu Thr Leu 

fVXGL OCX WCX wXU J. IXX UCU 










165 






170 


175 


Ser 


Val 


Lys 




Gin 


Tvr 

±/x 


■Pro Ser Phe 


Leu 


Arcr Thr Tie Pro Asn Asn 








180 






185 




190 


Lys 


Tyr 


Gin 


Val 


Glu 


Thr 


M^h Val TiPii 

I*iCU> .VCIX xscu 


Leu 


Tii&ii Gl n T.vca PVi** (Til vr Tvt^ 
xicu V9XU xjjr D xriic oxy xx^ 






195 








200 




205 


Thr 


Trp 




Ser 




Val 

V GLX 


Glv .Q^T* Got* 

wXjf OCX OCX 




ZVrti TH/r Glv Gin Tiian fll vr 

XjrX wXjr V7XX1 XJCU wXjr 




210 










215 




220 


Val 


Gin 


III a 




wXU 




m n 2il SI TTtk 

OXU nXcL XlXL 


Glv 


Gin Glv Tl*» Pva Tl*» JVl sa 
\jxxx \3JLy xxc \mya xxc AXd 


225 










230 

A J V 








Phe. 


Lys 


Asp 






JrXU 


IrtVS OCX AXa 


Gl n 


T7a 1 rtl V AsT^ Alii 2i>"rT Mof* 
VaX \3Xj OXU Ax^ ITlcb 










245 






250 


255 


Gin Cys 


Leu 


Met 




His 


Ti^ii 2Vl a Gl Tl 


Ala 


Glv Ala Thr Val Val Val 
wxjr nxa xxxx vcxx vax ve&x 








260 






265 




270 


Val Phe 


Ser 


Ser 


AXy 


Gin 


Leu. A.1 a Ara 

JUSwU <XXGi Vj 


Val 


Phe Phe Glu Ser Val Val 

f^xxc c^xxw V7X u OCX V ci>x vox 






275 








280 




285 


Leu 


Thr 


Asn 


Leu 


Thr 


Glv 

yxy 


Lve Val Tm 


Val 


TLla Ser Glu Ala Tro Ala 




290 










295 




300 


Leu 


Ser 




His 


He 


Thr 


Glv Val Pro 

V7Xjr V u J. JCrXW 


Glv 
oxy 


Tie Gin Ara Tie Glv Met 

xxc wXXX AX^ XXC wXy 1 i V? u« 


305 










310 






315 320 

•J X «J Jaw 


Val 


Leu 


V3xy 


Val 


Ala 


He 


Gin Tjva Arcr 


Ala 


Val Pro Glv Tjeii TiVS Al a 

vox XrXw wXy UCU Xiys fVXCi. 










325 






330 


335 


Phe 


Glu 


Glu 


Ala 


xyt 


Ala 


Airof Ala Aan 
^&x ^>x a £%D^ 


Tivs 
xiyo 


XiVS Ala Pro Arcr Pro Pvs 








340 






345 




350 


His 


Lys 


Glv 


Ser 


xxp 


Cys 


S&TT Sgy Asn 

0CX 0CX £%0^ 


Gin 


Leu Cvs Ara Olti Cvs Qln 

XJCU WjrB <%X^ wXU ^ O WXXI 






355 








360 




365 


Ala 


Phe 


Net 


Ala 


His 


Thr 


t Pro XjVS 


Leu 


Lvs Ala Phe Ser Met* Ser 

Xlj^B ^^XGL fXXG OCX X'XC b« OCX 




370 










375 




380 


Ser 


Ala 


Tyr 


Asn 


Ala 


Tyr 


Arg Ala Val 


Tyr 


Ala Val Ala His Gly Leu 


385 










390 






395 400 


His 


Gin 


Leu 


Leu 


Glv 


Cys 


Ala Pi*»r Glv 

exjk^cL V7^y 


Ala 


CV'S Ser Ara Glv Ara Val 


















XD 


Tyr 


Pro 




rtl n 

wXXA 




UC UL 


Gill Gin TIa 

^Xti wXXl xxc 


His 


TiVR Val TTi s Phe- Tieii Tieii 

Xlj^D vax XxXD XrXXC iJCU ucu 








420 






425 




430 

V V 


His 


Lys 


Asp 


Thr 


Val 


Ala 


Phe Asn Asp 


Asn 


Arg Asp Pro Leu Ser Ser 






435 








440 




445 


Tyr 


Asn 


lie 


He 


Ala 


ixp 


Asn Tni Asn 

xx^ <^DXx 


Glv 


Pro Iiva Tm Thr Phe Thr • 
XT X w xjjr o xx^ xxxx fxxc xxxx 




450 










455 




460 


Val 


Leu 




OCX 


OCX 


xux 


Ti^n Qat* Oi^n 
Xx^ OCX 4rXw 


Val 

VCIX 


Gin TiAiY Afiin Tlo Asn /3lii 

wXXJl XICU nOXl xxc AoU \JXU 


465 










470 






475 480 


Thr 


Lys 


He 


Gin 


Trp 


His 


Gly Lys Asp 


Asn 


Gin Val Pro Lys Ser Val 










485 






490 


495 


Cys 


Ser 


Ser 


Asp 


Cys 


Leu 


Glu Gly His 


Gin 


Arg Val Val Thr Gly Phe 








500 






505 




510 


His 


His 


Cys 


Cys 


Phe 


Glu 


Cys Val Pro 


Cys 


Gly Ala Gly Thr Phe Leu 






515 








520 




525 
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Asn Iiys Ser Asp Leu Tyr Arg Cys Gin Pro Cys Gly Lys Glu Glu Trp 

530 535 540 

Ala Pro Glu Gly Ser Gin Thr Cys Phe Pro Arg Thr Val Val Phe Leu 
545 550 555 560 

Ala Leu Arg Glu His Thr Ser Trp Val Leu Leu Ala Ala Asn Thr Leu 

565 570 575 

Leu Leu Leu Leu Leu Leu Gly Thr Ala Gly Leu Phe Ala Trp His Leu 

580 585 590 

Asp Thr Pro Val Val Arg Ser Ala Gly Gly Arg Leu Cys Phe Leu Met 

595 600 605 

Leu Gly Ser Leu Ala Ala Gly Ser Gly Ser Leu Tyr Gly Phe Phe Gly 

610 615 620 

Glu Pro Thr Arg Pro Ala Cys Leu Leu Arg Gin Ala Leu Phe Ala Leu 
625 630 635 640 

Gly Phe Thr lie Phe Leu Ser Cys Leu Thr Val Arg Ser Phe Gin Leu 

645 650 655 

lie lie He Phe Lys Phe Ser Thr Lys Val Pro Thr Phe Tyr His Ala 

660 665 670 

Trp Val Gin Asn His Gly Ala Gly Leu Phe Val Met He Ser Ser Ala 

675 680 685 

Ala Gin Leu Leu He Cys Leu Thr Tzp Leu Val Val Trp Thr Pro Leu 

690 695 700 

Pro Ala Arg Glu Tyr Gin Arg Phe Pro His Leu Val Met Leu Glu Cys 
705 710 715 720 

Thr Glu Thr Asn Ser Leu Gly Phe He Leu Ala Phe Leu Tyr Asn Gly 

725 730 735 

Leu Leu Ser He Ser Ala Phe Ala Cys Ser Tyr Leu Gly Lys Asp Leu 

740 745 750 

Pro Glu Asn Tyr Asn Glu Ala Lys Cys Val Thr Phe Ser Leu Leu Phe 

755 760 765 

Asn Phe Val Ser Trp He Ala Phe Phe Thr Thr Ala Ser Val Tyr Asp 

770 775 780 

Gly Lys Tyr Leu Pro Ala Ala Asn Met Met Ala Gly Leu. Ser Ser Leu 
785 790 795 800 

Ser Ser Gly Phe Gly Gly Tyr Phe Leu Pro Lys Cys Tyr Val He Leu 

805 810 815 

Cys Arg Pro Asp Leu Asn Ser Thr Glu His Phe Gin Ala Ser He Gin 

820 825 830 

Asp Tyr Thr Arg Arg Cys Gly Ser Thr 
835 840 

<210> 6 
<211> 839 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note » 
synthetic construct 

<400> 6 

Met Gly Pro Arg Ala Lys Thr He Cys Ser Leu Phe Phe Leu Leu Trp 

15 10 15 

Val Leu Ala Glu Pro Ala Glu Asn Ser Asp Phe Tyr Leu Pro Gly Asp 

20 25 30 

Tyr Leu Leu Gly Gly Leu Phe Ser Leu His Ala Asn Met Lys Gly He 
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35 










40 


Val 


Hie 


Leu 


Asn 


Phe 


Leu Gin 


Val 




50 










55 




Dys 


Val 

V CLJm 


He 




xyx 


Asn 


Leu 


Met 


65 










70 








Tl A 


Ti an 






Ser 


Ser 


Leu 










85 










Tip 


Val 




Val 


Cys 


Tyr 


He 








inn 

X w u 










Tyr 


"□Via 


Leu 


Hi a 
AJLa 


AXB 


Glu Asp 


7\ CST1 
noli 
















120 






Tyr 


Tl <a 
xxc 


OCX 


Arg Val 


Val 
V ax 














135 








Val 




XX JUL. 


Val 


Ala 


Asn 


1 4.5 
Xrt -J 










150 








fl A 






OCX 


Ala 


He 


Sat* 

OCX 










165 










FlTO 


Axa 


Leu 


Leu 


Arg Thr 


XllX 




















neu 


vax 


r*1 n 

oxn 


Leu 


nec 


Leu 


IlXB 
















9 on 


Leu 


vax 


Ser 


Ser 


Asp 


Thr 


Tyr 
















215 




urJ.U 


Arg 


Vax 


a! a 
AX a 


Arg 


Arg 


Asp 


Tl A 

xxts 












o o rt 










TiAll 
JJCU 


Gin 


JrX v 


Asn 


Gin 


Asn 










245 








Lj6U 


vax 




He 


Val 


ASp 


Lys 


Leu 








« O V 










va.x 


va± 


Jriie 


Ser 


Fro 


TV MM 

Asp 


Leu 


xuxr 
















280 


Leu 


Arg 




Ash 


DVia 

xrlie 


Thr 


Gly 


2il a 
nxa 




O Qft 










295 




TV 1 »s 


xie 


Asp 


Pro 


vax 


Leu 


His 


nSn 












310 








irxie 


Leu 


oxy 


Tl A 

xxe 


Thr 


He 


ni n 


















V3rXU 


OlA 


Arg 


nl 11 


irp 


Gly 


Pro 


Gin 








340 










TJir 


seiT 


n1 Tl 


Ser 


Tyr 


Thr 


Cys 


TV cm 






ICC 












AX a 


TiUT 


Leu 


Ser 


iriie 


Asn 


Thr 


Tl A 
XXc 




^•7ft 










375 




vax 


Tyr 


Ser 


Val 

vax 


Tyr 


Ser 


Ala 


Val 
VaX 


4 O 9 










390 






Ser 


Leu 


Leu 




Cys 


Asp 


Lys 


Ser 










405 








Piro 


Trp 


uxn 


Leu 


Leu 


Glu 


Glu 


Tl *a 

xxts 








420 










Asp 


His 


Gin 


He 


Phe 


Phe 


Asp 


Pro 






435 










440 


Glu 


He 


val 


Gin 


Trp 


Gin 


Trp 


Asp 




450 










455 




Val 


Ala 


Ser 


Tyr 


Tyr 


Pro 


Leu 


Gin 


465 










470 






He 


Ser 


Trp 


His 


Thr 


Val 


Asn 


Asn 



485 



45 



Pro 


Met 


Cys Lys Glu 


Tyr Glu 


Val 






60 






Gin 


Ala 


Met Arg Phe 


Ala Val 


Glu 






75 




80 


Leu 


Pro 


Gly Val Leu 


Leu Gly 


Tyr 




90 




95 




Ser 


Asn 


Asn Val Gin 


Pro Val 


Leu 


105 






110 




Leu 


Leu 


Pro Tie Gin 

C^lm\tJ ak^^ WabJU 


Glu AsD 


Tvr 






125 






Ala 


Val 


He Gly Pro 


Asp Asn 


Ser 






140 






Phe 


Leu 


Ser Leu Phe 


Leu Leu 


Pro 






155 




160 


Asp 


Glu 


Leu Aircf Asn 


Lys Val 


Ara 




170 




175 




Pro 


Ser 


A.la Asn His 


His Val 


Glu 


XO 3 






190 




JTllC 


TV vrr 


Ttt^ 2Van Ttt^ 
X X ^ rioAi X X j>i 


He He 


Val 
















A^n Glv Gin 

j^oxx \j^xx 


Leu Leu 


Glv 






220 






Cys 


He 


Ala Phe Gin 


Glu Thr 


Leu 






235 




240 


Met 


Thr 


Ser Glu Glu 


Arg Gin 


Arcr 




250 




255 




Gin 


Gin 


Ser Thr Ala 


Arg Val 


Val 


265 






270 




Leu 


xyx 


His Phe Phe 


Asn Glu 


Val 






285 






Val 


xxp 


lie Ala Ser 


Glu Ser 

w& ^^^^^ 


Trt) 






300 






Leu 


Thr 


Glu Leu Gly 


His Leu 


Glv 






'^i *^ 

X<7 




320 


Ser 


Val 


Pro He Pro 

f^^W C^^W 


Gly Phe 


Ser 




330 




335 




Ala 


Glv 


Pro Pro Pro 


Leu Ser 


Ara 


345 






350 




Gin 


Glu 


C?va Asn Asii 


Cva Leu 


Asn 












Leu 


Axy 


■ Leu Ser Qlv 


Glu Arg 


Val 






^ o w 








Ala 


Val Ala His 


Ala Leu 


His 










400 


Thr 


Cys 


Thr Lva Arcr 


Val Val 


Tvr 




41 n 

^ xu 




415 




ixp 


Lys 


Val Asn Phe 


Thr Leu 


Leii 








430 




Gin 


Gly 


Asp Val Ala 


Leu His 


Leu 






445 






Arg 


Ser 


Gin Asn Pro 


Phe Gin 


Ser 






460 






Arg 


Gin 


Leu Lys Asn 


He Gin 


Asp 






475 




480 


Thr 


He 


Pro Met Ser 


Met Cys 


Ser 




490 




495 





7 



wo 2005/015158 



PCT/US2004/025459 



I»ys 


Arg 


Cys 


Gin 


Ser 


Gly 


Gin 


Lys 








500 










Cys 


Cys 


Phe 


Glu 


Cys 


lie 


Asp 


Cys 






515 










520 


Thr 


Glu 


Asp 


Glu 


Tyr 


Glu 


Cys 


Gin 




530 










535 




Tyr 


Gin 


Ser 


Glu 


Thr 


Ser 


Cys 


Pne 


545 










550 






Trp 


His 


Glu 


Ala 


Pro 


Thr 


He 


Ala 










565 








Phe 


Leu 


Ser 


Thr 


Leu 


Ala 


He 


Leu 








580 










Thr 


Pro 


lie 


Val 


Arg 


Ser 


Ala 


Gly 






595 










600 


Thr 


Leu 


Leu 


Leu 


Val 


Ala 


Tyr 


Met 




610 










615 




Pro 


Lys 


Val 


Ser 


Thr 


Cys 


Leu 


Cys 


625 










630 






Phe 


Thr 


lie 


Cys 


lie 


Ser 


Cys 


He 










645 








Cys 


Ala 


Phe 


Lys 


Met 


Ala 


Ser 


Arg 








660 










Val 


Arg 


Tyr 


Gin 


Gly 


Pro 


Tyr 


Val 






675 










680 


Lys 


Met 


Val 


lie 


Val 


Val 


He 


Gly 




690 










695 




Thr 


Thr 


Arg 


Thr 


Asp 


Pro 


Asp 


Asp 


705 










710 






Asn 


Pro 


Asn 


Tyr 


Arg 


Asn 


Ser 


Leu 










725 








Leu 


Leu 


Ser 


Val 


Val 


Gly 


Phe 


Ser 








740 










Pro 


Thr 


Asn 


Tyr 


Asn 


Glu 


Ala 


Lys 






755 










760 


Tyr 


Phe 


Thr 


Ser 


Ser 


Val 


Ser 


Leu 




770 










775 




Gly 


Val 


Leu 


Val 


Thr 


He 


Val 


Asp 


785 










790 






Leu 


Ala 


lie 


Ser 


Leu 


Gly 


Tyr 


Phe 










805 








Phe 


Tyr 


Pro 


Glu 


Arg 


Asn 


Thr 


Pro 








820 










Gly 


Tyr 


Thr 


Met 


Arg 


Arg 


Asp 





835 



Lys 


Lys 


Pro 


Val 


Gly He 


His 


Val 


505 








510 






Leu 


Pro 


Gly 


Thr 


pne lieu 


Asn 


TJ J Q 

nlS 










525 


• 




At 9 

iijia 


Cys 


Pro 


Asn 


Asn ijxu 


Trp 


Ser 








540 








Lys 


Arg 


uin 


Leu 


vai irne 


Leu 


V7lU 






555 








560 


Val 


Ala 


Leu 


Leu 


Ala Ala 


Leu 


Gly 




Kin 
5 / u 








5 /b 




Val 


He 


Pne 


Trp 


Arg His 


Pne 


Gin 


c o e 








con 
590 






Gly 


Pro 


Met 


Cys 


Phe Leu 


Met 


Leu 










d05 






Val 


Val 


Pro 


Val 


Tyr Val 


Gly 


Pro 








620 








Arg 


Gin 


Ala 


Leu 


Phe Pro 


Leu 


Cys 






635 








640 


Ala 


Val 


Arg 


Ser 


Phe Gin 


He 


Val 




650 








o5b 




Phe 


Pro 


Arg 


Ala 


Tyr Ser 


Tyr 


Trp 


665 








o7U 






Ser 


Met 


Ala 


Pne 


He Thr 


Val 


Leu 










f oc - 
oo5 






Met 


Leu 


Ala 


Thr 


Gly Leu 


Ser 


Pro 








700 








Pro 


Lys 


He 


Thr 


He Val 


Ser 


Cys 






715 








720 


Leu 


Phe 


Asn 


Thr 


Ser Leu 


Asp 


Leu 




730 








735 




Phe 


Ala 


Tyr 


Met 


Gly Lys 


Glu 


Leu 


n A c 








•7 c n 






Phe 


He 


Thr 


Leu 


Ser Met 


Thr 


Phe 










765 






Cys 


Thr 


Phe 


Met 


Ser Ala 


Tyr 


Ser 








780 








Leu 


Leu 


Val 


Thr 


Val Leu 


Asn 


Leu 






795 








800 


Gly 


Pro 


Lys 


Cys 


Tyr Met 


He 


Leu 




810 








815 




Ala 


Tyr 


Phe 


Asn 


Ser Met 


He 


Gin 


825 








830 







<210> 7 
<211> 852 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; note » 
synthetic construct 

<400> 7 

Met Leu Gly Pro Ala Val Leu Gly Leu Ser Leu Trp Ala Leu Leu His 
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1 5 10 15 

Pro Gly Thr Gly Ala Pro Leu Cys Leu Ser Gin Gin Leu Arg Met Lys 

20 25 30 

Gly Asp Tyr Val Leu Gly Gly Leu Phe Pro Leu. Gly Glu Ala Glu Glu 

35 40 45 

Ala Gly Leu Arg Ser Arg Thr Arg Pro Ser Ser Pro Val Cys Thr Arg 

50 55 60 

Phe Ser Ser Asn Gly Leu Leu Trp Ala Leu Ala Met Lys Met Ala Val 
65 70 75 80 

Glu Glu He Asn Asn Lys Ser Asp Leu Leu Pro Gly Leu Arg Leu Gly 

85 90 95 

Tyr Asp Leu Phe Asp Thr Cys Ser Glu Pro Val Val Ala Met Lys Pro 

100 105 110 

Ser Leu Met Phe Leu Ala Lys Ala Gly Ser Arg Asp He Ala Ala Tyr 

115 120 125 

Cys Asn Tyr Thr Gin Tyr Gin Pro Arg Val Leu Ala Val He Gly Pro 

130 135 140 

His Ser Ser Glu Leu Ala Met Val Thr Gly Lys Phe Phe Ser Phe Phe 
145 150 155 160 

Leu Met Pro Gin Val Ser Tyr Gly Ala Ser Met Glu Leu Leu Ser Ala 

165 170 175 

Arg Glu Thr Phe Pro Ser Phe Phe Arg Thr Val Pro Ser Asp Arg Val 

180 185 190 

Gin Leu Thr Ala Ala Ala Glu Leu Leu Gin Glu Phe Gly Trp Asn Trp 

195 200 205 

Val Ala Ala Leu Gly Ser Asp Asp Glu Tyr Gly Arg Gin Gly Leu Ser 

210 215 220 

He Phe Ser Ala Leu Ala Ala Ala Arg Gly He Cys He Ala His Glu 
225 230 235 . 240 

Gly Leu Val Pro Leu Pro Arg Ala Asp Asp Ser Arg Leu Gly Lys Val 

245 250 255 

Gin Asp Val Leu His Gin Val Asn Gin Ser Ser Val Gin Val Val Leu 

260 265 270 

Leu Phe Ala Ser Val His Ala Ala His Ala Leu Phe Asn Tyr Ser He 

275 280 285 

Ser Ser Arg Leu Ser Pro Lys Val Trp Val Ala Ser Glu Ala Trp Leu 

290 295 300 

Thr Ser Asp Leu Val Met Gly Leu Pro Gly Met Ala Gin Met Gly Thr 
305 310 315 320 

Val Leu Gly Phe Leu Gin Arg Gly Ala Gin Leu His Glu Phe Pro Gin 

325 330 335 

Tyr Val Lys Thr His Leu Ala Leu Ala Thr Asp Pro Ala Phe Cys Ser 

340 345 350 

Ala Leu Gly Glu Arg Glu Gin Gly Leu Glu Glu Asp Val Val Gly Gin 

355 360 365 

Arg Cys Pro Gin Cys Asp Cys He Thr Leu Gin Asn Val Ser Ala Gly 

370 375 380 

Leu Asn His His Gin Thr Phe Ser Val Tyr Ala Ala Val Tyr Ser Val 
385 390 395 400 

Ala Gin Ala Leu His Asn Thr Leu Gin Cys Asn Ala Ser Gly Cys Pro 

405 410 415 

Ala Gin Asp Pro Val Lys Pro Trp Gin. Leu Leu Glu Asn Met Tyr Asn 

420 425 430 

Leu Thr Phe His Val Gly Gly Leu Pro Leu Arg Phe Asp Ser Ser Gly 

435 440 445 

Asn Val Asp Met Glu Tyr Asp Leu Lys Leu Trp Val Trp Gin Gly Ser 
450 455 460 
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VaJ. IrJuO 


Arg 


TiOii 

JJv? LL 


XlXB 




Glv 










470 






Leu 


Lys 




rtxy lip 


XIX D 








^ O 3 








Cys 


Ser 


Arg 


Gin Cys 


V7in 






e A A 

3UU 










U4 a 


Ser 


Cys 


uys iyr 


A en 
ASp 




CI c 
3 19 








3iS U 




win 


Asn 


Pro 






fi 








C 

3 J 3 






Ser 


Pro 


olU 


Arg Ser 




9 V w 
















Trp 


V7iy 


oiu Jrzro 










303 






Ser Leu 


TV 1 -» 

Aia 


Leu 


t?iy 


T ^11 "tZn 1 

Lieu vai 


Leu 


A 




C Q A 
3 O U 








His Arcf 


Asp 


Ser 


Pro 


T All X7n 1 

lieu vai 


(jin 














Phe Gly 


Leu 


val 


Cys 


Leu Gly 


Leu 


CI n 

D J.U 








013 




Pro Gly 


Gin 


Pro 


Ser 


Pro Ala 


9V<**<iw 

Arg 














Hi s Leu 


Fro 


Leu 


inr 


vaiy (jys 


Leu 








A4 
043 






ulu lie 


pne 


vai 


QalU 


ber ValU 


Leu 






O D u 








Ser Gly 


Cys 


Leu 


Arg 


Gly Pro 


Trp 




d7d 








A 


Met Leu 


Val 


Glu 


Val 


Ala Leu 


Cys 


f O A 








D273 




Pro Glu 


Val 


Val 


Tnr 


Asp Trp 


His 


AC 








/ Iv 




His Cys 




Tnr 


Arg 


Ser Trp 


val 








/ <63 






Asn. Ala. 


inr 


Leu 


Aia 


xrne xicu 


Cys 






/ ffeU 








Ser Gin 


Pro 


laiy 


Arg 


lyi Asn 


Arg 




T C C 










Leu Ala 


Tyr 


Pne 


lie 


■ 

Tnr Trp 


vai 


•7 T A 








i /3 




Val Gin 


Val 


Val 


Leu 


Arg Pro 


TV 1 s 

Ala 










790 




Cys Val 


Leu 


Gly 


He 


Leu Ala 


Ala 








805 






Leu Met 


Arg 


Gin 


Pro 


Gly Leu 


Asn 






820 








Gly Pro 


Gly 


Asp 


Ala 


Gin Gly 


Gin 




835 








840 


Gly Lys 


His 


Glu 









850 



<210> 8 
<211> 2526 
<212> DNA 

<213> Artificial Sequence 
<220> 



Ara 


Phe 


Asn 


Gly Ser 


Leu 


Arcr 


Thr 






475 








480 


Thr 


Ser 


Asp 


Asn Gin 


Lvs 


Pro 


Val 




490 








495 




Glu 


Glv 


Gin 


V eix AX 


AT*a 
Axy 


Val 


Lys 


9 v9 








51 0 

S X V 








Val 

V CIX 


Ann 


VojrB V9XU 


Ala 
Axa 


Glv 
uxy 


OCX 
















Ala 


Cys 


Thy 




Glv 


Gin 


AkSp 








540 










Cys 


Phe 


*4xy AX^ 


Acg 


Ser 


Axy 






555 








560 


Val 


Leu 


Leu 


Tjou Tieil 


Leu 


Leu 


Leu 




570 








575 




ZV1 7^ 


i*iXCl 




wxy XICU 


Php 

IT lie 


Val 

V CIX 


His 

XIX o 
















nXd 


OCX 


wXjr 


Vjx jf irX U 


XICU 


Al a 
Axa 










605 








VelX 


cys 


•Leu 


Cat- Va1 
OCX VaX 


Leu 


Leu 


irllc 
















Cys 


Leu 




G1 n G1 n 


xrx u 


XICU 


OCX 






D 9 








640 


OCX 


Th-r 


XICU 


JCrUC JJC U 


Gin 


Ala 


Ala 




OS l# 








655 

O 9 9 




jErXU 


JJCU 


OCX 


XX^ nXA 




Axg 


XICU 


665 








670 






nXO. 


Trp 


T 

Leu 


VaX Vctx 


XICU 


XJCU 


Ala 








Q O 3 








mr 


Trp 


Tyr 


xicU Val 


al a 
Aia 


PVia 
irXlc 


Pro 


















Leu 


Pro 


xhx i7xu 


2il a 


Leu 


Val 
Vax 






/ x3 








790 


OCX 


JrliC 


V7Xjr 


TiAii &1 a 
AJCU nxci 


His 

XXX o 


Al a 

AXd 


ThT 




720 








735 




xriic: 


xicu 


Glv 


XAJLX XrlXC 


Leu 


V CiX 


Al^fT 


745 








750 






Ala 


Axg 


Gly 


XJCU XIAX 


Phe 


Ala 

AXd 


i'lC^ 








765 








OCX 


XrilC 


V ax 


IrX (J . XICU 


ij6U 


al a 


Asn 








780 








Val 


Gin 


Met 


Gly Ala 


Leu 


Leu 


Leu 






795 








800 


Phe 


His 


Leu 


Pro Arg 


Cys 


Tyr 


Leu 




810 








815 




Thr 


Pro 


Glu 


Phe Phe 


Leu 


Gly 


Gly 


825 








830 






Asn 


Asp 


Gly 


Asn Thr 


Gly 


Asn 


Gin 



845 
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<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 8 

atgctgctct gcacggctcg cctggtcggc ctgcagcttc tcatttcctg ctgctgggcc 60 

tttgcctgcc atagcacgga gtcttctcct gacttcaccc tccccggaga ttacctcctg 120 

gcaggcctgt tccctctcca ttctggctgt ctgcaggtga ggcacagacc cgaggtgacc 180 
ctgtgtgaca ggtcttgtag cttcaatgag catggctacc acctcttcca ggctatgcgg . 240 

cttggggttg aggagataaa caactccacg gccctgctgc ccaacatcac cctggggtac 300 

cagctgtatg atgtgtgttc tgactctgcc aatgtgtatg ccacgctgag agtgctctcc 3 60 

ctgccagggc aacaccacat agagctccaa ggagaccttc tccactattc ccctacggtg 420 

ctggcagtga ttgggcctga cagcaccaac cgtgctgcca ccacagccgc cctgctgagc 480 

cctttcctgg tgcccatgat tagctatgcg gccagcagcg agacgctcag cgtgaagcgg 540 

cagtatccct ctttcctgcg caccatcccc aatgacaagt accaggtgga gaccatggtg 600 

ctgctgctgc agaagttcgg gtggacctgg atctctctgg ttggcagcag tgacgactat 660 

gggcagctag gggtgcaggc actggagaac caggccactg gtcaggggat ctgcattgct 720 

ttcaaggaca tcatgccctt ctctgcccag gtgggcgatg agaggatgca gtgcctcatg 780 

cgccacctgg cccaggccgg ggccaccgtc gtggttgttt tttccagccg gcagttggcc 840 

agggtgtttt tcgagtccgt ggtgctgacc aacctgactg gcaaggtgtg ggtcgcctca 900 

gaagcctggg ccctctccag gcacatcact ggggtgcccg ggatccagcg cattgggatg 960 

gtgctgggcg tggccatcca gaagagggct gtccctggcc tgaaggcgtt tgaagaagcc 1020 

tatgcccggg cagacaagaa ggcccctagg ccttgccaca agggctcctg gtgcagcagc 1080 

aatcagctct gcagagaatg ccaagctttc atggcacaca cgatgcccaa gctcaaagcc 1140 

ttctccatga gttctgccta caacgcatac cgggctgtgt atgcggtggc ccatggcctc 1200 

caccagctcc tgggctgtgc ctctggagct tgttccaggg gccgagtcta cccctggcag 1260 

cttttggagc agatccacaa ggtgcatttc cttctacaca aggacactgt ggcgtttaat 1320 

gacaacagag atcccctcag tagctataac ataattgcct gggactggaa tggacccaag 1380 

tggaccttca cggtcctcgg ttcctccaca tggtctccag ttcagctaaa cataaatgag 1440 

accaaaatcc agtggcacgg aaaggacaac caggtgccta agtctgtgtg ttccagcgac 1500 

tgtcttgaag ggcaccagcg agtggttacg ggtttccatc actgctgctt tgagtgtgtg 1560 

ccctgtgggg ctgggacctt cctcaacaag agtgacctct acagatgcca gccttgtggg 1620 

aaagaagagt gggcacctga gggaagccag acctgcttcc cgcgcactgt ggtgtttttg 1680 

gctttgcgtg agcacacctc ttgggtgctg ctggcagcta acacgctgct gctgctgctg 1740 

ctgcttggga ctgctggcct gtttgcctgg cacctagaca cccctgtggt gaggtcagca 1800 

gggggccgcc tgtgctttct tatgctgggc tccctggcag caggtagtgg cagcctctat 1860 

ggcttctttg gggaacccac aaggcctgcg tgcttgctac gccaggccct ctttgccctt 1920 

ggtttcacca tcttcctgtc ctgcctgaca gttcgctcat tccaactaat catcatcttc 1980 

aagttttcca ccaaggtacc tacattctac cacgcctggg tccaaaacca cggtgctggc 2040 

ctgtttgtga tgatcagctc agcggcccag ctgcttatct gtctaacttg gctggtggtg 2100 

tggaccccac tgcctgctag ggaataccag cgcttccccc atctggtgat gcttgagtgc 2160 

acagagacca actccctggg cttcatactg gccttcctct acaatggcct cctctccatc 2220 

agtgcctttg cctgcagcta cctgggtaag gacttgccag agaactacaa cgaggccaaa 2280 

tgtgtcacct tcagcctgct cttcaacttc gtgtcctgga tcgccttctt caccacggcc 2340 

agcgtctacg acggcaagta cctgcctgcg gccaacatga tggctgggct gagcagcctg 2400 

agcagcggct tcggtgggta ttttctgcct aagtgctacg tgatcctctg ccgcccagac 2460 

ctcaacagca cagagcactt ccaggcctcc attcaggact acacgaggcg ctgcggctcc 2520 

acctga 2526 

<210> 9 . 
<211> 2559 
<212> DNA 

<213> Artificial Sec^ience 
<220> 

<223> Description of Artificial Sequence; note « 
synthetic construct 
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<400> 9 

atgctgggcc ctgctgtcct gggcctcagc ctctgggctc tcctgcaccc tgggacgggg 60 

gccccattgt gcctgtcaca gcaacttagg atgaaggggg actacgtgct gggggggctg . 120 

ttccccctgg gcgaggccga ggaggctggc ctccgcagcc ggacacggcc cagcagccct 180 

gtgtgcacca ggttctcctc aaacggcctg ctctgggcac tggccatgaa aatggccgtg 240 

gaggagatca acaacaagtc ggatctgctg cccgggctgc gcctgggcta cgacctcttfc 300 

gatacgtgct cggagcctgt ggtggccatg aagcccagcc tcatgttcct ggccaaggca 360 

ggcagccgcg acatcgccgc ctactgcaac tacacgcagt accagccccg tgtgctggct 420 

gtcatcgggc cccactcgtc agagctcgcc atggtcaccg gcaagttctt cagcttcttc 480 

ctcatgcccc aggtcagcta cggtgctagc atggagctgc tgagcgcccg ggagaccttc 540 

ccctccttct tccgcaccgt gcccagcgac cgtgtgcagc tgacggccgc cgcggagctg 600 

ctgcaggagt tcggctggaa ctgggtggcc gccctgggca gcgacgacga gtacggccgg 660 

cagggcctga gcatcttctc ggccctggcc gcggcacgcg gcatctgcat cgcgcacgag 720 

ggcctggtgc cgctgccccg tgccgatgac tcgcggctgg ggaaggtgca ggacgtcctg 780 

caccaggtga accagagcag cgtgcaggtg gtgctgctgt tcgcctccgt gcacgccgcc 840 

cacgccctct tcaactacag catcagcagc aggctctcgc ccaaggtgtg ggtggccagc 900 

gaggcctggc tgacctctga cctggtcatg gggctgcccg gcatggccca gatgggcacg 960 

gtgcttggct tcctccagag gggtgcccag ctgcacgagt tcccccagta cgtgaagacg 1020 

cacctggccc tggccaccga cccggccttc tgctctgccc tgggcgagag ggagcagggt 1080 

ctggaggagg acgtggtggg ccagcgctgc ccgcagtgtg actgcatcac gctgcagaac 1140 

gtgagcgcag ggctaaatca ccaccagacg ttctctgtct acgcagctgt gtatagcgtg 1200 

gcccaggccc tgcacaacac tcttcagtgc aacgcctcag gctgccccgc gcaggacccc 1260 

gtgaagccct ggcagctcct ggagaacatg tacaacctga ccttccacgt gggcgggctg 1320 

ccgctgcggt tcgacagcag cggaaacgtg gacatggagt acgacctgaa gctgtgggtg 1380 

tggcagggct cagtgcccag gctccacgac gtgggcaggt tcaacggcag cctcaggaca 1440 

gagcgcctga agatccgctg gcacacgtct gacaaccaga agcccgtgtc ccggtgctcg 1500 

cggcagtgcc aggagggcca ggtgcgccgg gtcaaggggt tccactcctg ctgctacgac 1560 

tgtgtggact gcgaggcggg cagctaccgg caaaacccag acgacatcgc ctgcaccttt 1620 

tgtggccagg atgagtggtc cccggagcga agcacacgct gcttccgccg caggtctcgg 1680 

ttcctggcat ggggcgagcc ggctgtgctg ctgctgctcc tgctgctgag cctggcgctg 1740 

ggccttgtgc tggctgcttt ggggctgttc gttcaccatc gggacagccc actggttcag 1800 

gcctcggggg ggcccctggc ctgctttggc ctggtgtgcc tgggcctggt ctgcctcagc 1860 

gtcctcctgt tccctggcca gcccagccct gcccgatgcc tggcccagca gcccttgtcc 1920 

cacctcccgc tcacgggctg cctgagcaca ctcttcctgc aggcggccga gatcttcgtg 1980 

gagtcagaac tgcctctgag ctgggcagac cggctgagtg gctgcctgcg ggggccctgg 204 0. 

gcctggctgg tggtgctgct ggccatgctg gtggaggtcg cactgtgcac ctggtacctg 2100 

gtggccttcc cgccggaggt ggtgacggac tggcacatgc tgcccacgga ggcgctggtg 2160 

cactgccgca cacgctcctg ggtcagcttc ggcctagcgc acgccaccaa tgccacgctg 2220 

gcctttctct gcttcctggg cactttcctg gtgcggagcc agccgggctg ctacaaccgt 2280 

gcccgtggcc tcacctttgc catgctggcc tacttcatca cctgggtctc ctttgtgccc 2340 

ctcctggcca atgtgcaggt ggtcctcagg cccgccgtgc agatgggcgc cctcctgctc 2400 

tgtgtcctgg gcatcctggc tgccttccac ctgcccaggt gttacctgct catgcggcag 2460 

ccagggctca acacccccga gttcttcctg ggagggggcc ctggggatgc ccaaggccag 2520 

aatgacggga acacaggaaa tcaggggaaa catgagtga 2559 

<210> 10 
<211> 2518 
<212> DNA 

<213> Artificial Sec[uence 
<220> 

<223> DeBcription of Artificial Sequence; note a 
synthetic construct 

<400> 10 . 

atggggccca gggcaaagac catctgctcc ctgttcttcc tcctatgggt cctggctgag 60 

ccggctgaga actcggactt ctacctgcct ggggattacc tcctgggtgg cctcttctcc 120 
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ctccatgcca acatgaaggg cattgttcac cttaacttcc tgcaggtgcc catgtgcaag 180 

gagtatgaag tgaaggtgat aggctacaac ctcatgcagg ccatgcgctt cgcggtggag 240 

gagatcaaca atgacagcag cctgctgcct ggtgtgctgc tgggctatga gatcgtggat 300 

gtgtgctaca tctccaacaa tgtccagccg gtgctctact tcctggcaca cgaggacaac 3 60 

ctccttccca tccaagagga ctacagtaac tacatttccc gtgtggtggc tgtcattggc 420 

cctgacaact ccgagtctgt catgactgtg gccaattcct ctccctattt ctccttccac 480 

agatcaccta cagcgccatc agcgatgagc tgcgagacaa ggtgcgcttc ccggctttgc 540 

tgcgtaccac acccagcgcc gaccaccacg tcgaggccat ggtgcagctg atgctgcact 600 

tccgctggaa ctggatcatt gtgctggtga gcagcgacac ctatggccgc gacaatggca 660 

gctgcttggc gagcgcgtgg cccggcgcga catctgcatc gccttccagg agacgctgcc 720 

cacactgcag cccaaccaga acatgacgtc agaggagcgc cagcgcctgg tgaccattgt 780 

ggacaagctg cagcagagca cagcgcgcgt cgtggtcgtg ttctcgcccg acctgaccct 840 

gtaccacttc ttcaatgagg tgctgcgcca gaacttcacg ggcgccgtgt ggatcgcctc 900 

cgagtcctgg gccatcgacc cggtcctgca caacctcacg gagctgggcc acttgggcac 960 

cttcctgggc atcaccatcc agagcgtgcc catcccgggc ttcagtgagt tccgcgagtg 1020 

gggcccacag gctgggccgc cacccctcag caggaccagc cagagctata cctgcaacca 1080 

ggagtgcgac aactgcctga acgccacctt gtccttcaac accattctca ggctctctgg 1140 

ggagcgtgtc gtctacagcg tgtactctgc ggtctatgct gtggcccatg ccctgcacag 1200 

cctcctcggc tgtgacaaaa gcacctgcac caagagggtg gtctacccct ggcagctgct 1260 

tgaggagatc tggaaggtca acttcactct cctggaccac caaatcttct tcgacccgca 1320 

aggggacgtg gctctgcact tggagattgt ccagtggcaa tgggaccgga gccagaatcc 1380 

cttccagagc gtcgcctcct actaccccct gcagcgacag ctgaagaaca tccaagacat 1440 

ctcctggcac accgtcaaca acacgatccc tatgtccatg tgttccaaga ggtgccagtc 1500 

agggcaaaag aagaagcctg tgggcatcca cgtctgctgc ttcgagtgca tcgactgcct 1560 

tcccggcacc ttcctcaacc acactgaaga tgaatatgaa tgccaggcct gcccgaataa 1620 

cgagtggtcc taccagagtg agacctcctg cttcaagcgg cagctggtct tcctggaatg 1680 

gcatgaggca cccaccatcg ctgtggccct gctggccgcc ctgggcttcc tcagcaccct 1740 

ggccatcctg gtgatattct ggaggcactt ccagacaccc atagttcgct cggctggggg 1800 

ccccatgtgc ttcctgatgc tgacactgct gctggtggca tacatggtgg tcccggtgta 1860 

cgtggggccg cccaaggtct ccacctgcct ctgccgccag gccctctttc ccctctgctt 1920 

cacaatttgc atctcctgta tcgccgtgcg ttctttccag atcgtctgcg ccttcaagat 1980 

ggccagccgc ttcccacgcg cctacagcta ctgggtccgc taccaggggc cctacgtctc 2040 

tatggcattt atcacggtac tcaaaatggt cattgtggta attggcatgc tggccacggg 2100 

cctcagtccc accacccgta ctgaccccga tgaccccaag atcacaattg tctcctgtaa 2160 

ccccaactac cgcaacagcc tgctgttcaa caccagcctg gacctgctgc tctcagtggt 2220 

gggtttcagc ttcgcctaca tgggcaaaga gctgcccacc aactacaacg aggccaagtt 2280 

catcaccctc agcatgacct tctatttcac ctcatccgtc tccctctgca ccttcatgtc 2340 

tgcctacagc ggggtgctgg tcaccatcgt ggacctcttg gtcactgtgc tcaacctcct 2400 

ggccatcagc ctgggctact tcggccccaa gtgctacatg atcctcttct acccggagcg 2460 

caacacgccc gcctacttca acagcatgat ccagggctac accatgagga gggactag 2518 

<210> 11 
<211> 2577 
<212> DNA 

<213> Artificial Secpience 
<220> 

<223> Description of Artificial Sequence; note s 
synthetic construct 

<400> 11 . 

atgccgggtt tggctatctt gggcctcagt ctggctgctt tcctggagct tgggatgggg 60 

tcctctttgt gtctgtcaca gcaattcaag gcacaagggg actatatatt gggtggacta 120 

tttcccctgg gcacaactga ggaggccact ctcaaccaga gaacacagcc caacggcatc 180 
ctatgtacca ggttctcgcc ccttggtttg ttcctggcca tggctatgaa gatggctgta . 240 

gaggagatca acaatggatc tgccttgctc cctgggctgc gactgggcta tgacctgttt 300 

gacacatgct cagagccagt ggtcaccatg aagcccagcc tcatgttcat ';ggccaaggtg 360 
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ggaagtcaaa gcattgctgc ctactgcaac tacacacagt accaaccccg tgtgctggct 420 

gtcattggtc cccactcatc agagcttgcc ctcattacag gcaagttctt cagcttcttc 480 

ctcatgccac aggtcagcta tagtgccagc atggatcggc taagtgaccg ggaaacattt 540 

ccatccttct tccgcacagt gcccagtgac cgggtgcagc tgcaggccgt tgtgacactg 600 

ttgcagaatt tcagctggaa ctgggtggct gccttaggta gtgatgatga ctatggccgg 660 

gaaggtctga gcatcttttc tggtctggcc aactcacgag gtatctgcat tgcacacgag 720 

ggcctggtgc cacaacatga cactagtggc caacaattgg gcaaggtggt ggatgtgcta 780 

cgccaagtga accaaagcaa agtacaggtg gtggtgctgt ttgcatctgc ccgtgctgtc 840 

tactcccttt ttagctacag catccttcat gacctctcac ccaaggtatg ggtggccagt 900 

gagtcctggc tgacctctga cctggtcatg acacttccca atattgcccg tgtgggcact 960 

gttcttgggt ttctgcagcg cggtgcccta ctgcctgaat tttcccatta tgtggagact 1020 

cgccttgccc tagctgctga cccaacattc tgtgcctccc tgaaagctga gttggatctg 1080 

gaggagcgcg tgatggggcc acgctgttca caatgtgact acatcatgct acagaacctg 1140 

tcatctgggc tgatgcagaa cc tat cage t gggcagttgc accaccaaat atttgcaacc 1200 

tatgcagctg tgtacagtgt ggctcaggcc cttcacaaca ccctgcagtg caatgtctca 1260 

cattgccaca catcagagcc tgttcaaccc tggcagctcc tggagaacat gtacaabatg 1320 

agtttccgtg ctcgagactt gacactgcag tttgatgcca aagggagtgt agacatggaa 1380 

tatgacctga agatgtgggt gtggcagagc cctacacctg tactacatac tgtaggcacc 1440 

ttcaacggca cccttcagct gcagcactcg aaaatgtatt ggccaggcaa ccaggtgcca 1500 

gtctcccagt gctcccggca gtgcaaagat ggccaggtgc gcagagtaaa gggctttcat 1560 

tcctgctgct atgactgtgt ggactgcaag gcagggagct accggaagca tccagatgac 1620 

ttcacctgta ctccatgtgg caaggatcag. tggtccccag aaaaaagcac aacctgctta 1680 

cctcgcaggc ccaagtttct ggcttggggg gagccagctg tgctgtcact tctcctgctg 1740 

ctttgcctgg tgctgggcct gacactggct gccctggggc tctttgtcca ctactgggac 1800 

agccctcttg ttcaggcctc aggtgggtca ctgttctgct ttggcctgat ctgcctaggc 1860 

ctcttctgcc tcagtgtcct tctgttccca ggacgaccac gctctgccag ctgccttgcc 1920 

caacaaccaa tggctcacct ccctctcaca ggctgcctga gcacactctt cctgcaagca 1980 

gccgagatct ttgtggagtc tgagctgcca ctgagttggg caaactggct ctgcagctac 2040 

cttcggggcc cctgggcttg gctggtggta ctgctggcca ctcttgtgga ggctgcacta 2100 

tgtgcctggt acttgatggc tttccctcca gaggtggtga cagattggca ggtgctgccc 2160 

acggaggtac tggaacactg ccgcatgcgt tcctgggtca gcctgggctt ggtgcacatc 2220 

accaatgcag tgttagcttt cctctgcttt ctgggcactt tcctggtaca gagccagcct 2280 

ggtcgctata accgtgcccg tggcctcacc ttcgccatgc tagcttattt catcatctgg 2340 

gtctcttttg tgcccctcct ggctaatgtg caggtggcct accagccagc tgtgcagatg 2400 

ggtgctatct tattctgtgc cctgggcatc ctggccacct tccacctgcc caaatgctat 2460 

gtacttctgt ggctgccaga gctcaacacc caggagttct tcctgggaag gagccccaag 2520 

gaagcatcag atgggaatag tggtagtagt gaggcaactc ggggacacag tgaatga 2577 

<210> 12 ^ 
<211> 137 
<212> PRT 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence; note « 



synthetic 


construct 










<400> 12 












Pro Ser Pro Phe 


Arg Asp He 


Val 


Ser 


Tyr Pro 


Asp Lys He He Leu 


1 


5 






10 


15 


Gly Cys Phe Met 


Asn Leu Lys 


Thr 


Ser 


Ser Val 


Ser Phe Val Leu Leu 


20 






25 




30 


Leu Leu Leu Cys 


Leu Leu Cys 


Phe 


He 


Phe Ser 


Tyr Met Gly Lys Asp 


35 




40 






45 


Leu Pro Lys Asn 


Tyr Asn Glu 


Ala 


Lys 


Ala He 


Thr Phe Cys Leu Leu 


50 


55 








60 


Leu Leu He Leu 


Thr Trp He 


lie 


Phe 


Thr Thr 


Ala Ser Leu Leu Tyr 
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65 


70 










75 












Gin Gly Lys Tyr He His 


Ser 


Leu 


Asn Ala 


Leu Ala Val 


Leu 


Ser 


Ser 




85 








90 










95 




lie Tyr Ser Phe Leu Leu Trp 


Tyr 


Phe Leu Pro Lys Cys 


Tyr 


He 


He 




100 






105 










110 






He Phe Gin 


Pro Gin Lys Asn 


Thr 


Gin Ijys Tyr Phe Gin 


Gly 


Leu 


He 


115 






120 










125 








Gin Asp Tyr Thr Lys Thr 


He 


Ser 


Gin 
















130 




135 




















<210> 13 
























<211> 242 
























<212> PRT 
























<213> Artificial Sequence 




















<220> 
























<223> Description of Artificial 


Sequence; note o 








synthetic construct 




















<40U> 13 
























Fne Aia vajL 


Asn ryr Asn 


inr 


Pro 


Val 


Val 


Arg 


Ser 


Ala 


Gly 


Gly 


Pro 


1 










10 










15 




Met Cvs Phe 


Leu Xle Leu 


Glv 


Cys 


Leu 


Ser 


Leu 


Cys 


Ser 


He 


Ser 


Val 




20 






25 










30 






Phe Phe Tyr 


Phe Glu Arg 


Pro 


Thr 


Glu 


Ala 


Phe 


Cys 


He 


Leu 


Arg 


Phe 


35 






40 










45 








Met Pro Phe 


Leu Leu Phe 


Tyr 


Ala 


Val 


Cys 


Leu 


Ala 


Cys 


Phe 


Ala 


Val 


50 




55 










60 










Arg Ser Phe 


Gin He Val 


He 


He 


Phe 


Lys 


He 


Ala 


Ala 


Lys 


Phe 


Pro 


65 


70 










75 










80 


Arg Val His 


Ser Trp Trp 


Met 


Lys 


Tyr 


His 


Gly 


Gin 


Tarp 


Leu 


Val 


He 




85 








90 










95 




Ser Met Thr 


Phe Val Leu 


Gin 


Ala 


Val 


Val 


He 


Val 


He 


Gly 


Phe 


Ser 




100 






105 










110 






Ser Asn Pro 


Pro Leu Pro 


Tyr 


Xaa 


Xaa 


Phe 


Val 


Ser 


Tyr 


Pro 


Asp 


Lys 


115 






120 










125 








He He Leu 


Gly Cys Asp 


Val 


Asn 


Leu 


Asn 


Met 


Ala 


Ser 


Thr 


Ser 


Phe 


130 




135 










140 










Phe Leu Leu 


Leu Leu Leu 


Cys 


He 


Leu 


Cys 


Phe 


Thr 


Phe 


Ser 


Tyr 


Met 


145 


150 










155 










160 


Gly Lys Asp 


Leu Pro Lys 


Asn 


Tyr 


Asn 


Glu 


Ala 


Lys 


Ala 


lie 


Thr 


Phe 




165 








170 










175 




Cys Leu Leu 


Leu Leu He 


Leu 


Thr 


Trp 


He 


He 


Phe 


Ala 


Thr 


Ala 


Phe 


180 






185 










190 






Met Leu Tyr 


His Gly Lys 


Tyr 


He 


His 


Thr 


Leu 


Asn 


Ala 


Leu 


Ala 


Val 


195 






200 










205 








Leu Ser Ser 


Ala Tyr Cys 


Phe 


Leu 


Leu 


Trp 


Tyr 


Phe 


Leu 


Pro 


Lys 


Cys 


210 




215 










220 










Tyr He He 


He Phe Gin 


Pro 


His 


Lys 


Asn 


Thr 


Gin 


Lys 


Tyr 


Phe 


Gin 



225 230 235 240 

Leu Ser 



<210> 14 
<211> 165 
<212> PRT 
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<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; note « 
synthetic construct 



<400> 14 



Iivs Lys 


Gin 


Gly Pro 


Glu 


Val Asp 


He Phe He 


Val Ser Val 


Thr He 


1 






5 










10 






15 


Leu Cys 


lie 


Ser 


Val 


Leu 


Gly Vai 


Ala Val Gly 


Pro Pro Glu 


Pro Ser 






20 










25 






30 




Gin Asp 


Leu 


Asp 


Phe 


Tyr 


Met Asp 


Ser 


He 


Val 


Leu Glu Cys 


Ser Asn 




35 










40 








45 




Thr Leu 


Ser 


Pro Gly 


Ser 


Phe 


He 


Glu Leu Cys 


Tyr Val Cys 


Val Leu 


50 










55 










60 




Ser val 


Leu 


Cys 


Phe 


Phe 


Phe 


Ser 


Tyr Met Gly 


Lys Asp Leu 


Pro Ala 


65 








70 










75 




80 


Asn Tyr 


Asn 


Glu Ala 


Lvs 


Cys 


Val 


Thr 


Phe 


Ser 


Leu Met Val 


Tyr Met 








85 










90 






95 


lie Ser 


Trp 


He 


Ser 


Phe 


Phe 


Thr 


Val Tyr Leu 


He Ser Arg 


Gly Pro 






100 










105 






110 




Phe Thr 


Val 


Ala 


Ala 


Tyr 


Val 


Cys 


Ala 


Thr 


Leu 


Val Ser Val 






115 










120 








125 




Phe Phe 


Gly 


Gly Tyr 


Phe 


Leu 


Pro 


Lys 


He Tyr 


He He Val 


Leu Lys 


130 








135 










140 




Pro Gin 


Met 


Asn 


Thr 


Thr 


Ala His 


Phe 


Gin Asn 


Cys He Gin 


Met Tyr 


145 








150 










155 




160 


Thr Met 


Ser 


Lys Gin 
























165 


















<210> 15 






















<211> 236 






















<212> PRT 






















<213> Artificial Sequence 














<220> . 
























<223> Description of Artificial 


Sequence; note s * 




synthetic 


construct 














<400> 15 






















Ala Pro 


Lys 


Ser 


Ser 


Gin 


Arg 


Xaa 


Leu 


Arg 


Arg 


Thr Arg Leu 


Xaa Ijeu 


1 






5 










10 






15 


Glu Trp 


Asp 


His 


Pro 


Met 


Ser 


Val 


Ala 


Leu 


Leu 


Phe Phe Leu 


Val Cys 






20 










25 






30 




Cys Leu 


Leu 


Met 


Thr 


Ser 


Ser 


Ser 


Ala 


Val 


He 


Leu Leu Leu 


Asn He 




35 










40 








45 




Asn Thr 


Pro 


Val 


Ala 


Lys 


Ser 


Ala 


Gly 


Gly 


Xaa 


Thr Cys Xaa 


Leu Lys 


50 










55 










60 




Leu Ala 


Ala 


Leu 


Thr 


Ala 


Ala 


Ala 


Met 


Ser 


Ser 


Xaa Cys His 


Phe Gly 


65 








70 










75 




80 


Gin Pro 


Ser 


Pro 


Leu 


Ala 


Ser 


Lys 


Leu 


Lys 


Gin 


Pro Gin Phe. 


Thr Phe 








85 










90 






95 


Ser Phe 


Thr 


Val 


Cys 


Leu 


Ala 


Cys 


Asn 


Arg 


Cys 


Ala Leu Ala 


Thr Gly 






100 










105 






110 




His Leu 


His 


Phe 


Xaa 


He 


Arg 


Val 


Ala 


Leu 


Pro 


Pro Ala Tyr 


Asn Xaa 




115 










120 








125 




Tirp Ala 


Lys 


Asn 


His 


Gly 


Pro 


Xaa 


Ala 


Thr 


He 


Phe He Ala 


Ser Ala 
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130 










135 










140 








Ala 


He 


Leu Cys Val 


Leu 


Cys Leu 


Arg Val Ala 


Val 


Gly Pro Pro 


Gin 


145 










150 










155 








160 


Pro 


Ser 


Gin 


Asx 


Leu 


Asx 


Phe 


Xaa 


Thr 


Asn 


Ser 


He 


Xaa 


Leu Xaa 


Xaa 










165 










170 








175 




Ser 


Asn 


Thr 


Leu 


Ser 


Pro 


Gly Ser 


Phe 


Val 


Glu 


Leu 


Cys Asn Val 


Ser 








180 










185 










190. . 




Leu 


Leu 


Ser Ala 


Val 


Cys 


Phe 


Val 


Phe 


Ser 


Xaa 


Met 


Gly Lys Asx 


Leu 






195 










200 










205 






Pro 


Ala 


Asn Tyr Ash 


Glu 


Ala Lys 


Cys Val Thr 


Phe 


Ser 


Leu Met 


Val 




210 










215 










22 0 








Asn 


Xaa 


He 


Ser Trp 


He 


Ser 


Phe 


Phe 


Thr 


Val 


Tyr 








225 










230 










235 










<210> 16 


























<211> 838 


























<212> PRT 


























<213> Artificial Sequence 


















<220> 




























<223> Description of Artificial 


Sequence; note s 








synthetic 


construct 


















<400> 16 


























Met 


Gly 


Pro 


Arg 


Ala 


Lys- 


Thr 


He 


Cvs 


Ser 


Leu 


Phe 


Phe 


Leu Leu 


Trp 


1 








5 










10 








15 




Val 


Leu 


Ala 


Glu 


Pro 


Ala 


Glu 


Asn 


Ser 


ASD 


Phe 


TVT 


Leu Pro Gly 


Asp 








20 










25 










30 




Tyr 


Leu 


Leu 


Gly 


Gly 


Leu 


Phe 


Ser 


Leu 


His 


Ala 


Asn 


Met Lys Gly 


He 






35 










.40 










45 






Val 


His 


Leu 


Asn 


Phe 


Leu 


Gin 


Val 


Pro 


Met 


Cys 


Lys 


Glu Tyr Glu 


Val 




50 










55 










60 








Lys 


Val 


He 


Gly 


Tyr 


Asn 


Leu 


Met 


Gin 


Ala 


Met 


Arg 


Phe 


Ala Val 


Glu 


65 










70 










75 








80 


Qlu 


He 


Asn 


Asn 


Asp 


Ser 


Ser 


Leu. 


Leu 


Pro 


Gly 


Val 


Leu Leu Gly 


Tyr 










85 










90 








95 




Glu 


He 


Val 


Asp 


Val 


Cys 


Tyr 


He 


Ser 


Asn 


Asn 


Val 


Gin 


Pro Val 


Leu 








100 










105 










110 




Tyr 


Phe 


Leu 


Ala 


His 


Glu 


Asp 


Asn 


Leu 


Leu 


Pro 


He 


Gin 


Glu Asp 


Tyr 






115 










120 










125 






Ser 


Asn 


Tyr 


He 


Ser 


Arg 


Val 


Val 


Ala 


Val 


He 


Gly 


Pro 


Asp Asn 


Ser 




130 










135 










140 








Glu 


Ser 


Val 


Met 


Thr 


Val 


Ala 


Asn 


Phe 


Leu 


Ser 


Leu 


Phe 


Leu Leu 


Pro 


145 










150 










155 








160 


Gin 


He 


Thr 


Tyr 


Ser 


Ala 


He 


Ser 


Asp 


Glu 


Leu 


Arg 


Asp 


Lys Val 


Arg 










165 










170 








175 




Phe 


Pro 


Ala 


Leu 


Leu 


Arg 


Thr 


Thr 


Pro 


Ser 


Ala 


Asp 


His 


His Val 


Glu 








180 










185 










190 




Ala 


Met 


Val 


Gin 


Leu 


Met 


Leu 


His 


Phe 


Arg 


Trp 


Asn 


Trp 


He He 


Val 






195 










200 










205 






Leu 


Val 


Ser 


Ser 


Asp 


Thr 


Tyr 


Gly 


Arg 


Asp 


Asn 


Gly 


Gin 


Leu Leu 


Gly 




210 










215 










220 








Glu 


Arg 


Val 


Ala 


Arg 


Arg 


Asp 


He 


Cys 


He 


Ala 


Phe 


Gin 


Glu Thr 


Leu 


225 










230 










235 








240 


Pro 


Thr 


Leu 


Gin 


Pro 


Asn 


Gin 


Asn 


Met 


Thr 


Ser 


Glu 


Glu 


Arg Gin 


Arg 










245 










250 








255 




Leu 


Val 


Thr 


He 


Val 


Asp 


Lys 


Leu 


Gin 


Gin 


Ser 


Thr 


Ala 


Arg Val 


Val 
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260 265 270 

Val Val Phe Ser Pro Asp Leu Thr Leu Tyr His Phe Phe Asn Glu Val 

275 280 285 

Leu Arg Gin Asn Phe Thr Gly Ala Val Trp lie Ala Ser Glu Ser Trp 

290 295 300 

Ala lie Asp Pro Val Leu His Asn Leu Thr Glu Leu Qly His Leu Gly 
305 310 315 320 

Thr Phe Leu Gly lie Thr lie Gin Ser Val Pro lie Pro Gly Phe Ser 

325 330 335 

Glu Phe Arg Glu Trp Gly Pro Gin Ala Gly Pro Pro Pro Leu Ser Arg 

340 345 ^ 350 

Thr Ser Gin Ser Tyr Thr Cys Asn Gin Glu Cys Asp Asn Cys Leu Asn 
355 360 365 



Ala Thr Leu Ser Phe Asn Thr He Leu Arg Leu Ser Gly Glu Acg Val 

370 375 380 

Val Tyr Ser Val Tyr Ser Ala Val Tyr Ala Val Ala His Ala Leu His 
385 390 395 400 

Ser Leu Leu Gly Cys Asp Lys Ser Thr Cys Thr Lys Arg Val Val Tyr 

405 410 415 

Pro Trp Gin Leu Leu Glu Glu He Trp Lys Val Asn Phe Thr Leu Leu 

420 425 430 

Asp His Gin He Phe Phe Asp Pro Gin Gly Asp Val Ala Leu His Leu 

435 440 445 

Glu He Val Gin Trp Gin Trp Asp Arg Ser Gin Asn Pro Phe Gin Ser 

450 455 460 

Val Ala iSer Tyr Tyr Pro Leu Gin Arg Gin Leu Lys Asn He Gin Asp 
465 470 475 480 

He Ser Trp His Thr Val Asn Asn Thr He Pro Met Ser Met Cys Ser 

485 490 495 

Lys Arg Cys Gin Ser Gly Gin Lys Lys Lys Pro Val Gly He His Val 

500 505 510 

Cys. Cys Phe Glu Cys He Asp Cys Leu Pro Gly Thr Phe Leu Asn His 

515 520 525 

Thr Glu Asp Glu Tyr Glu Cys Gin Ala Cys Pro Asn Asn Glu Trp Ser 

530. 535 540 

Tyr Gin Ser Glu Thr Ser Cys Phe Lys Arg Gin lieu Val Phe Leu Glu 
545 550 555 560 

His Glu Val Pro Thr He Val Val Ala He Leu Ala Ala Leu Gly Phe 

565 570 575 

Phe Ser Thr Leu Ala He Leu Phe He Phe Trp Arg His Phe Gin Thr 

580 585 590 

Pro Met Val Arg Ser Ala Gly Gly Pro Met Cys Phe Leu Met Leu Val 

595 600 605 

Pro Leu Leu Leu Ala Phe Gly Met Val Pro Val Tyr Val Gly Pro Pro 

610 615 620 

Thr Val Phe Ser Cys Phe Cys Arg Gin Ala Phe Phe Thr Val Cys Phe 
.625 630 635 640 

Ser He Cys Leu Ser Cys He Thr Val Arg Ser Phe Gin He Val Cys 

645 650 . 655 

Val Phe Lys Met Ala Arg Arg Leu Pro Ser Ala Tyr Ser Phe Trp Met 

660 665 670 

Arg Tyr His Gly Pro Tyr Val Phe Val Ala Phe He Thr Ala He Lys 

675 680 685 

Val Ala Leu Val Val Gly Asn Met Leu Ala Thr Thr He Asn Pro He 

690 695 700 

Gly Arg Thr Asp Pro Asp Asp Pro Asn He Met He Leu Ser Cys His 
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705 








710 




PlTO 


Asn 




Asn 


Glv 


Leu Leu 








725 






Xi6U 




Val 


Tjeu Glv 


Phe 


Ser Phe 








740 








Asn 




Asn Gill 


Ala 


Tjvs Phe 






755 






760 


Phe 


Thz* 


Seir 


Ser lie 


Ser 


Leu Cvs 




770 








775 


Val 


Leu 


Val 


Thr He 


Met 


Asp Leu 


785 








790 




Ala 


He 


Gly 


Leu Gly 


Tyr 


Phe Gly 








805 






Tyr 


Pro 


Glu 


Arg Asn 


Thr 


Ser Ala 








820 






Tyr 


Thr 


Met 


Arg Lys 


Ser 





835 



<210> 17 
<211> 844 

<212> PRT • 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial 
synthetic construct 



<400> 17 



Met 


Gly 


Pro 


Gin 


Ala 


Arg 


Thr 


Leu 


1 








5 








Val 


Leu 


Pro 


Lys 


Pro 


Gly 


Lys 


Leu 








20 










Ala 


Gly 


Asp 


Tyr 


Leu 


Leu 


Gly 


Gly 






35 










40 


Lys 


Ser 


He 


Ser 


His 


Leu 


Ser 


Tyr 




50 










55 




Phe 


Thr 


Met 


Lys 


Val 


Leu 


Gly 


Tyr 


65 










70 






Ala 


Val 


Glu 


Glu 


He 


Asn 


Asn 


Cys 










85 








Leu 


Gly 


Tyr 


Glu 


Met 


Val 


Asp 


Val 








100 










Pro 


Gly 


Leu 


Tyr 


Phe 


Leu. 


Ala 


Gin 






115 










120 


Lys 


Asp 


Tyr 


Ser 


Gin 


Tyr 


Net 


Pro 




130 










135 




Asp 


Asn 


Ser 


Glu 


Ser 


Ala 


He 


Thr 


145 










150 






Leu 


lie 


Pro 


Gin 


He 


Thr 


Tyr 


Ser 










165 








Lys 


Arg 


His 


Phe 


Pro 


Ser 


Met 


Leu 








180 










His 


He 


Glu 


Ala 


Met 


Val 


Gin 


Leu 






195 










200 


He 


Val 


Val 


Leu 


Val 


Ser 


Asp 


Asp 




210 










215 




Leu 


Leu 


Ser 


Gin 


Arg 


Leu 


Thr 


Lys 







715 










720 


Phe 


Asn 


Thr 


Ser 


Met 


ASD 


Leu 


Leu 




730 










735 




Ala 


Tyr 


Met 


Gly Lys 


Glu 


Leu 


Pro 


745 










750 






He 


Thr 


Leu 


Ser 


Met 


Thr 


Phe 


Ser 










765 








Thr 


Phe 


Met 


Ser Val 


His 


Asp 


Gly 








780 










Leu 


Val 


Thr 


Val 


Leu 


Asn 


Phe 


Leu 






795 










800 


Pro 


Lys 


Cys 


Tyr Met 


He 


Leu 


Phe 




810 










815 




Tyr 


Phe 


Asn 


Ser 


Met 


He 


Gin 


Gly 


825 










830 






Sequence; note » 








Cys 


Leu 


Leu 


Ser 


Leu 


Leu 


Leu 


His 




10 










15 




Val 


Glu 


Asn 


Ser 


Asp 


Phe 


His 


Leu 


25 










30 






Leu 


Phe 


Thr 


Leu 


His 


Ala 


Ann 


Val 
vox 










45 








Leu 


Gin 


Val 


Pro 


Lys 


Cys 


Asn 


Glu 








60 










Asn 


Leu 


Met 


Gin 


Ala 


Met 




Phe 






75 










80 


Ser 


Ser 


Leu 


Leu 


Pro 


Glv 


Val 


Leu 




90 










95 




Cys 


Tyr 


Leu 


Ser 


Asn 


Asn 


lie 


His 


105 










110 






Asp 


Asp 


Asp 


Leu 


Leu 


Pro 


He 


Leu 










125 








His 


Val 


Val 


Ala 


Val 


He 


Glv 


Pro 








140 










Val 


Ser 


Asn 


He 


Leu 


Ser 


His 


Phe 






155 










160 


Ala 


He 


Ser 


Asp 


Lys 


Leu 




Asp 




170 










175 




Arg 


Thr 


Val 


Pro 


Ser 


Ala 


Thr 


His 


185 










190 






Met 


Val 


His 


Phe 


Gin 


Trp 


Asn 


Trp 










205 








Asp 


Tyr 


Gly 


Arg 


Glu 


Asn 


Ser 


His 








220 










Thr 


Ser 


Asp 


He 


Cys 


He 


Ala 


Phe 



19 
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225 230 







vax 


Leu 


Pro 
245 


Tl A 

xjie 


Pro 


V7XU 


Gov* 


61u 


Gin 


Arg 


Gin 
260 


Leu 


Asp 


Asn 


He 


Leu 


Ala 


Arg 


Val 

275 


Val 


Val 


Val 


Phe 


Ser 


Pro 


Phe 


His 

O Q A 


Glu 


Val 


Leu 


Arg 


Trp 

O Q C 
Z 3 


Asn 


Phe 


Ser 


GlU 


Ser 


Trp 


Ala 


He 


Asp 


Pro 


Val 


one 










11 n 








A3r9 


XIXB 






XiiJu 
o o c 






Glv 


Val 


Pro 


Gly 


Phe 


Ser 
340 


Gin 


Phe 


Arg 


Val 


Arg 

•a A c 


Val 


Pro 


Asn 
355 


Thr 


Thr 


Asn 


Leu 


Arg 

O f A 

360 


Thr 


Ala 


Cys 

370 


Leu 


Asn 


Thr 


Thr 


Lys 

375 


Ser 


Phe 


Gly 


Qlu 


Arg 


Val 


Val 


Tyr 


Ser 


Val 


Tyr 


385 










390 








His 


Ala 


Leu 


rilS 


TV 

Arg 
405 


ijeu 


Leu 


v»iy 


Cys 


Gin 


Lys 


Val 


Tyr 
420 


Pro 


Trp 


Gin 


Leu 


Leu 

>i o e 
425 


Phe 


Thr 


Leu 
435 


Leu 


Gly 


Asn 


Arg 


Leu 

440 


Phe 


Pro 


Met 
450 


Leu 


Leu 


Asp 


He 


He 
455 


Gin 


Trp 


Pro 


Phe 


Gin 


Ser 


He 


Ala 


Ser 


Tyr 


Ser 


465 










Ann 
% /{J 








Tyr 




Asn 


Asn 


V a,X 
485 


OCX 


Tim 


xyr 




Ser 


Met 


Cys 


Ser 


Lys 


Ser 


Cys 


Gin 


Pro 

c n c: 


Gly 


Leu 


His 
515 


Pro 


Cys 


Cys 


Phe 


Glu 
520 


Cys 


Tyr 


Leu 
530 


Asn 


Arg 


Ser 


Ala 


Asp 
535 


Glu 


Phe 


Ser 


Met 


Trp 


Ser 


Tyr 


Lys 


Asn 


Asp 


He 


545 










550 








Thr 


File 


Leu 


vjXU 


Trp 
565 


Trp 


xixs 


\jXU 


AX a 


Leu 


Ala 


Ala 


Leu 
580 


Gly 


Phe 


Leu 


Ser 


Thr 
585 


Trp 


Arg 


His 
595 


Phe 


Gin 


Thr 


Pro 


He 

f A A 

600 


Val 


Cys 


Phe 
610 


Leu 


Met 


Leu 


Thr 


Leu 
615 


Leu 


Leu 


Val 


Tyr 


Val 


Gly 


Pro 


Pro 


Lys 


Val 


Ser 


625 










630 








Leu 


Phe 


Pro 


Leu 


Cys 
645 


Phe 


Thr 


He 


Cys 


Ser 


Phe 


Gin 


He 

660 


Val 


Cys 


Ala 


Phe 


Lys 
665 


Ala 


Tyr 


Ser 
675 


Tyr 


Trp 


Val 


Arg. 


Tyr 
680 


Gin 



PCT/US2004/025459 
235 240 





nl n 

UXZl 


Val 

VAX 






OCX 


Glu 


J? n 


















TtOii 






Thr 


Ser 










om 
^ f \j 






V9XU 






TiAii 

JJCU 




Ser 


Phe 
















Thr* 


m V 


XrllC 


Val 

V CIX 




He 


Ala 


















nxo 






XllX 


Glu 


Leu 














J ^ 1/ 


Th-r 




m n 

wXll 




Val 

V OIX 


uCX 


Tl 
xxe 


^•*n 
















XL fin 


TjVS 


X w 


Glv 
oxy 


lyx 


Pro 
















xnr 


Cys 


Asn 


ni n 


nop 


Cys 


Asp 
















Asn 


Asn 


xxe 


Leu 


Tl ^ 

xxe 


Leu 


Ser 






"a o A 










Ser 


i\xa 


vax 


Tyr 


al a 


vax 


al a 




o o c 










AAA 
4UU 


Asn 


J^trg 


Va 1 


Arg 




xnr 


X)y D 


A 1 A 

4XU 










A 1 C 
4X3 




Arg 


I.7XU 


Tl » 

xxe 


Trp 


AXB 


Va 1 
VaX 


'A^n 










40U 








Asp 


urXXi 


n7X1X 


kjxy 


&aiA 


i*lCi« 








A AR 










ilp 


& em 
Aop 




OCX 


m n 

V7X1JL 








A £r A 










Pro 


iiir 


Ser 


Lys 


Arg 


Leu 


XiXx 




AT c 










Ann 




no IX 


no 11 


XllX 


Val 

V CIX 


XT X 


Val 
V ax 


A QA 










A QC 
4 73 




oxy 


oxn 




T 

Lys 


Lys 


Ser 


\Ts» 1 
Vclx 










3 xu 






Leu 


Asp 


Cys 




Pro 


\jxy 


Xllx 
















noli 




TiAii 
ucu 


OCX 




irx u 


Ol V 
t»xy 
















mr 


Cys 


Jrne 


vixn 


Arg 


Arg 


Pro 




c e e 

555 










5o0 


irro 


iiixr 


Tl 
XXc 


Al a 
nXa 


Val ' 

VCiX 


ai a 


XICU 


CT A 

570 










e n c 

575 




Leu 


ft! a 
AX a 


Tl o 
XX6 


Leu 


\Tsi 1 
VolX 


Tl e 

XXc 


Irllc 










can 






Arg 


Ser 


ai a 


m vr 

vaxy 


urxy 


Pro 


ricv 








f A c 
DUD 








VCLX 


Ala 


xyx 


1*1C I* 


Val 

VAX 


Val 
V nx 


IrX U 






620 










Thr 


Cys 


Leu 


Cys 


Arg 


Gin 


Ala 




635 










640 


He 


Ser 


Cys 


He 


Ala 


Val 


Arg 


650 










655 




Met 


Ala 


Ser 


Arg 


Phe 


Pro 


Arg 










670 






Gly 


Pro 


Tyr 


Val 


Ser 


Met 


Ala 



685 
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Phe 


He 


Thr 


Val 


Leu 


Lys 


Met 


val He 


Val 


Val 


He Gly Met Leu Ala 




690 










695 








700 


Thr 


Gly 


Leu 


Ser 


Pro 


Thr 


Thr 


Arg Thr 


Asp 


Pro 


Asp Asp Pro Lys He 


705 










710 








715 


720 


Thr 


He 


Val 


Ser 


Cys 


Asn 


Pro 


Asn Tyr 


Arg 


Asn 


Ser Leu Leu Phe Asn 










725 








730 




735 


Thr 


Ser 


Leu 


Asp 


Leu 


Leu 


Leu 


Ser Val 


Val 


Gly 


Phe Ser Phe Ala Tyr 








740 








745 






750 


Met 


Gly 


Lys 


Glu 


Leu 


Pro 


Thr 


Asn Tyr 


Asn 


Glu 


Ala Lys Phe He Thr 






755 










760 






765 


Leu 


Ser 


Met 


Thr 


Phe 


Tyr 


Phe 


Thr Ser 


Ser 


Val 


Ser Leu Cys Thr Phe 




770 










775 








780 


Met 


Ser 


Ala 


Tyr 


Ser 


Gly 


val 


Leu Val 


Thr 


He 


Val Asp Leu Leu Val 


785 










790 








795 


800 


Thr 


Val 


Leu 


Asn 


Leu 


Leu 


Ala 


lie Ser 


Leu 


Gly 


Tyr Phe Gly Pro Lys 










605 








810 




815 


Cys 


Tyr 


Met 


He 


Leu 


Phe 


Tyr 


Pro Glu 


Arg 


Asn 


Thr Pro Ala Tyr Phe 








820 








825 






830 


Asn 


Ser 


Met 


He 


Gin 


Gly 


Tyr 


Thr Met 


Arg 


Arg 


Asp 






835 










840 









<210> 18 
<211> 855 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note » 
synthetic construct 



<400> 18 


















Met 


Leu 


Gly 


Pro 


Ala 


Val 


Leu 


Gly Leu 


Ser 


Leu 


Trp Ala Leu Leu His 


1 








5 








10 




15 


Pro 


Gly 


Thr 


Gly 


Ala 


Pro 


Leu 


Cys Leu 


Ser 


Gin 


Gin Leu Arg Met Lys 








20 








25 






30 


Gly 


Asp 


Tyr 


Val 


Leu 


Gly 


Gly 


Leu Phe 


Pro 


Leu 


Gly Glu Ala Glu Glu 






35 










40 






45 


Ala 


Gly 


Leu 


Arg 


Ser 


Arg 


Thr 


Arg Pro 


Ser 


Ser 


Pro . Val Cys Thr Arg 




50 










55 








60 


Phe 


Ser 


Ser 


Asn 


Gly 


Leu 


Leu 


Trp Ala 


Leu 


Ala 


Met Lys Met Ala Val 


65 










70 








75 


80 


Glu 


Glu 


He 


Asn 


Asn 


Lys 


Ser 


Asp Leu 


Leu 


Pro 


Gly Leu Arg Leu Gly 










85 








90 




95 


Tyr 


Asp 


Leu 


Phe 


Asp 


Thr 


Cys 


Ser Glu 


Pro 


Val 


Val Ala Met Lys Pro 








100 








105 






110 


Ser 


Leu 


Met 


Phe 


Leu 


Ala 


Lys 


Ala Gly 


Ser 


Arg 


Asp He Ala Ala Tyr 






115 










120 






125 


Cys 


Asn 


Tyr 


Thr 


Gin 


Tyr 


Gin 


Pro Arg 


val 


Leu 


Ala Val He Gly Pro 




130 










135 








140 


His 


Ser 


Ser 


Glu 


Leu 


Ala 


Met 


Val Thr 


Gly 


Lys 


Phe Phe Ser Phe Phe 


145 










150 








155 


160 


Leu 


Met 


Pro 


Gin 


Val 


Ser 


Tyr 


Gly Ala 


Ser 


Met 


Glu Leu Leu Ser Ala 










165 








170 




175 


Arg 


Glu 


Thr 


Phe 


Pro 


Ser 


Phe 


Phe Arg 


Thr 


Val 


Pro Ser Asp Arg Val 








180 








185 






190 
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Leu 


Thr 


Ala 

Aia 


Ala 


Ala 


uXU 


Leu 
















o n n 
z u u 


\Tsx 1 
VGi± 


A±di 


A±a 


Leu 


til "»*• 
viJLy 


Ser 


Asp 


Asp 




Z±\J 










2 15 






Ji'iie 


Ser 


AXdi 


Leu 


Ala 


Ala 

Aia 


Aia. 












O "J A 

230 






Gly 


Leu 


Val 


Pro 


Leu 


Pro 


Arg 


Ala 










245 








Gin 


Asp 


Val 


Leu 


His 


Gin 


Val 


Asn 








2d0 










Leu 


Pne 


Ala 


Ser 


Val 


His 


Ala 


IVl B 

Ala 






n ^7 c 

275 










2o0 


Sex* 


Ser 


Arg 


Leu 


ser 


pro 


Lys 


vai 




o on 














Thx 


Ser 


Asp 


Leu 


Val 


Met 


Gly 


Leu 


n c: 










310 






Vai 


Leu 


Gly 


Pne 


Leu 


Gin 


TV 

Arg 


Gly 










325 








Tyr 


Val 


Lys 


Thr 


His 


Leu 


Ala 


Leu 








340 










Ala 


Leu 


Gly 


Glu 


Arg Glu 


Gin 


Gly 






^ c e 
o55 












Arg 


Cys 


Pro 


Gin 


Cys Asp 


Cys 


lie 




n n A 

370 










375 




Leu 


Asn 


His 


HXS 


Gin 


Thr 


Pne 


ser 


o o c 










390 






Ala 


Gin 


Ala 


Leu 


His 


Asn 


Thr 


Leu 










405 








Ala 


Gin 


Asp 


. Pro 


Val Lys 


Pro 


Trp 








420 










Leu 


Thr 


Phe 


His 


Val Gly 


Gly 


Leu 






>i *s e 

435 










A A n 

440 


Asn 


Val 


Asp 


Met 


Glu Tyr 


Asp 


Leu 




4b0 










/ICC 

455 




vax 


Pro 


Arg 


Leu 


His Asp 


•vai 


Gly 












470 






Glu 


Arcf 


Leu 


Lys 


lie Arg 


Trp 


XT J ta 
HIS 










485 








Ser 


Arg 


Cys 


Ser 


Arg Gin 


Cys 


Gin 








C A A 
500 










Gly 


Phe 


His 


Ser 


Cys 


Cys 


Tyr 


Asp 






515 










CO A 

520 


Tyr 


Arg 


Gin 


Asn 


Pro Asp 


Asp 


He 




C "S A 










5ij5 




Glu 


Trp 


Ser 


Pro 


Glu Arg 


Ser 


Thr 


545 










550 






Pne 


Leu 


Glu 


Leu 


Ala Trp 


Gly 


Glu 










565 








Leu 


Leu 


cys 


Leu 


Val 


Leu 


Giy 


Leu 








c; an 










Val 


His 


Tyr 


Trp 


Asp Ser 


Pro 


Leu 






595 










600 


Phe 


Cys 


Phe 


Gly 


Leu 


He 


Cys 


Leu 




610 










615 




Leu 


Phe 


Pro 


Gly 


Arg Pro 


Arg 


Ser 


625 










630 






Met 


Ala 


His 


Leu 


Pro 


Leu 


Thr 


Gly 



Leu 


V7in 


V7lU 


Fne 


Giy 


Trp 


Asn Trp 










2 U3 








Pill 

vjrlU 


Tyr 


vjiy 


Arg 


r"! n 

vjin 


Vjiy 


Leu 


Ser 








22 u 










Arg 


vaiy 


lie 


Cys 


lie 


Ala 


His 


Glu 






•Jit; 

2 JO 










240 


ASp 


Asp 


Ser 


Arg 


Leu 


uiy 


Lys Val 




29U 










255 




Gin 


Ser 


Ser 


Val 


Gin 


vai 


Val 


Leu 


o ^ e 
265 










OTA 
270 






174 a 

HIS 


Ala 

Ala 


Leu 


jrne 


TV Mwft 

Asn 


Tyr 


Ser 


He 










O 0 c 
2 05 








Trp 


vai 


"At a 

Aia 


Ser 


GIU 


TV 1 a 

Aia 


Trp 


Leu 








J U U 










Pro 


vjiy 


wee 


a1 a 

Aia 


r«i T-i 

Csin 


raec 


Gly Thr 
















320 


Aia 


■vjin 


Leu 


HIS 


r^l 11 
GIU 


Fne 


Pro 


Gin 




ion 










335 




Ala 


Thr 


TV 

ASp 


Pro 


TV"! a 

Ala 


Pne 


Cys 


Ser 


j45 










"a c A 
j5U 






Leu 


Glu 


Glu 


Asp 


Val 


Val 


Gly Gin 










O f c 

3o5 








ml. 

Thr 


Leu 


Gin 


TV «w« 

Asn 


vai 


Ser 


Ala Gly 








ion 










vai 


Tyr 


Ala 


Aia 


vai 


Tyr 


Ser 


Val 






395 










400 


Gin 


Cys 


Asn 


Ala 


Ser 


Gly 


Cys 


Pro 




410 










415 




Gin 


Leu 


Leu 


Glu 


Asn 


Met 


Tyr Asn 


425 










430 






Pro 


Leu 


Arg 


Phe 


Asp 


Ser 


Ser 


Gly 










A A C 

445 








Lys 


Leu Trp 


Val 


Trp 


Gin 


Gly 


Ser 








>l ^A 

4oU 










Arg 


Phe 


Asn 


m mm 

Gly 


ser 


Leu 


Arg 


Thr 






475 










480 


Thr 


Ser Asp 


Asn 


Gin 


Lys 


Pro 


Val, 




490 










495 




GIU 


Gly Gin 


vai 


TV i^<^ 

Arg 


TV 

Arg 


Val 


Lys 


CAE 

505 










CIA 

510 






cys 


Val 


Asp 


cys 


Glu 


TV 1 a 

Ala 


Gly 


Ser 










525 








Ala 


Cys Thr 


Pile 


Cys 


Gly 


Gin 


Asp 








C A A 

54U 










Arg 


Cys 


Phe 


TV irj-ii 

Arg 


Arg 


TV Tj-j-g 

Arg 


Ser 


Arg 






555 










560 


Pro 


Ala Val 


Leu 


Ser 


Leu 


Leu 


Leu 




570 










575 




Thr 


Leu 


Ala 


TV 1 a 

Ala 


Leu 


Gly 


Leu 


Phe 


9 O 3 










on 






Val 


Gin 


Ala 


Ser 


Gly 


Gly 


Ser 


Leu 










605 








Gly 


Leu 


Phe 


Cys 


Leu 


Ser 


Val 


Leu 








620 










Ala 


Ser 


Cys 


Leu 


Ala 


Gin 


Gin 


Pro 






635 










640 


Cys 


Leu 


Ser 


Thr 


Leu 


Phe 


Leu 


Gin 
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645 



&1 9 

AXo. 


al a 

HJ.CL. 


urXU 


T1 Dhca Val nlll 
XXc Jcrllc VaX V3XU 


OCX 








A 

OOU 




Trp 


Leu 


cys 


Ser Tyr Leu Arg 


f3l w 

bxy 










OOU 


lieu 


AXa 




Xieu VaX uXu AXa 


a la 
Axa 














Pro 




flilti Val Val Thr 
uxu vaj. vax xixx 


Asp 


•7 n 






f XU 




Leu 


ijXU 


£1X0 


uy S Ax g ric u AX y 


i9v7X 








•7 0 K 








Asn 


al a Val TiOii Hi a 
AXa VaX JUCU AXa 










74U 






vsxn 


Seir 


urXn irxO uxy Axg 


xyr 






e e 

755 




/ DU 


A J. a. 


rdeu 


Leu 


AXa xyr Jrue xxe 


Tl 9 

xxe 




770 




775 




AX a 


Asn 


vax 


r^ln Val &1 a TW^t* 
V7Xn VaX AXa lyx 


f3l n 


785 






790 




Leu 


Phe 


Cys 


Ala Leu Gly He 


Leu 








805 




Tyr 


Val 


Leu 


Leu Trp Leu Pro 


Glu 








820 




Gly 


Arg 


Ser 


Pro Lys Glu Ala 


Ser 






835 


» 


840 


Ala 


Thr 


Arg 


Gly His Ser. Glu 






850 




855 





<210> 19 
<211> 859 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
. synthetic construct 



<400> 19 



Met 


Pro 


Gly 


Leu 


Ala 


He 


Leu 


Gly 


1 








5 








Leu 


Gly 


Met 


Gly 
20 


Ser 


Ser 


Leu 


Cys 


Gly 


Asp 


Tyr 
35 


He 


Leu 


Gly 


Gly 


Leu 
40 


Ala 


Thr 
50 


Leu 


Asn 


Gin 


Arg 


Thr 
55 


Gin 


Phe 


Ser 


Pro 


Leu 


Gly 


Leu 


Phe 


Leu 


65 










70 






Glu 


Glu 


He 


Asn 


Asn 

85 


Gly 


Ser 


Ala 


Tyr 


Asp 


Leu 


Phe 
100 


Asp 


Thr 


Cys 


Ser 


Ser 


Leu 


Met 
115 


Phe 


Met 


Ala 


Lys 


Val 
120 


Cys 


Asn 
130 


Tyr 


Thr 


Gin 


Tyr 


Gin 
135 


Pro 


His 


Ser 


Ser 


Glu 


Leu 


Ala 


Leu 


He 



















vaXU 


Leu 


Pro 


Leu 


Gav 

oer 


Trp 


Al a 
AXa 


Asn 


005 










fi7n 






Pro 


Trp 


Axa 


Trp 


Leu 


Val 

VaX 


Val 
VaX 


Leu 










D09 








Leu 


Cys 


ill a 

Axa 


Trp 


Tyr 


Leu 


neu 


2Vl a 
AXa 








700 










Trp 


uxn 


Val 
VaX 


Leu 




XAXX 


wXU 


Val 
vax 






1 X3 










720 


Trp 


Val 

vax 


Ser 


Leu 


rjl \r 


Xi6U 


Val 
VaX 


nXo 




/ J u 










7*^ 
/ OD 




Leu 


Cys 


r'ne 


Leu 


rtl V 


XXlx 


JrXie 


Leu 


















Asn 


Arg 


& 1 a 

Axa 


Arg 


^1 
laXy 


Leu 


Tor 


irne 










7o5 








m 

Trp 


vax 


Ser 


Fne 


Val 

vax 


Pro 


Leu 


Leu 








/ ou 










Pro 


Ala 


Val 


Gin 


Met 


Gly 


Ala 


He 






795 










800 


Ala 


Thr 


Phe 


His 


Leu 


Pro 


Lys 


Cys 




810 










815 




Leu 


Asn 


Thr 


Gin 


Glu 


Phe 


Phe 


Leu 


825 










830 






Asp 


Gly 


Asn 


Ser 


Gly 


Ser 


Ser 


Glu 










845 








Sequence; note a 








Leu 


Ser 


Leu 


Ala 


Ala 


pne 


Leu 


r»i i'i 
oXU 




10 










xo 




Leu 


Ser 


Gin 


Gin 


Phe 


Lys 


M a 

Axa 


ni n 


25 
















Phe 


Pro 


Leu 


Gly 


Thr 


inr 


ni i« 

liXU 


1^1 11 

uXU 










45 








Pro 


Asn 


Gly 


He 


Leu 


Cys 


Thr 


1i -ir 

Arg 








60 










Ala 


Met 


Ala 


Met 


Lys 


net 


Al -» 

Ala 


vax 






75 










a A 
oO 


Leu 


Leu 


Pro 


Gly 


Leu 


Arg 


Leu 


Gly 




90 














Glu 


Pro 


Val 


Val 


Thr 


Met 


Lys 


Pro 


105 










110 






Gly 


Ser 


Gin 


Ser 


He 


Ala 


Ala 


Tyr 










125 








Arg 


Val 


Leu 


Ala 


Val 


He 


Gly 


Pro 








140 










Thr 


Gly 


Lys 


Phe 


Phe 


Ser 


Phe 


Phe 
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Ij6U 


Met 


Pro 


isln 


Arg 


Glu 


Thr 


Phe 








180 


Ol n 




Gin 


Ala 






1 OR 




Va.± 




211 a 


Leu 










Tl A 


irlie 


Ser 


vjxy 










wxy 


T>A11 
JJC LI 


V ox 




Val 


Asp 


Val 


Leu 








260 




flic 


Ala 












Ij6U 




Asp 


Leu 


















^ n c 










Leu 


\j±y 




Tyr 


Val 


Glu 


Thr 








340 


Sgit 


Leu 


Lys 


AT a 






1 K R 
ODD 




Cys 


Ser 




Cys 




J /u 






rlCU 


V7>Lxl 
















Ala 


Ala 


Val 


Cys 


Asn 


Val 


Ser 








420 


liGU 


Leu 


\J± u. 


Abu 










XJCU 


m n 


Ph@ 


Asn 




A e n 






Met- 


irp 


Val 
vax 














TV an 






Asn 


Qln 


Val 


Pro 








500 


Val 
VaX 


Arg 


Arg 


Val 












uyo 


Ala 


Gly 




CO A 






Pro 


Cys 


Gly 


Lys 


545 








Pro 


Arg 


Arg 


Pro 


Leu 


Leu 


Leu 


Leu 








580 


Ala 


Leu 


Gly 


Leu 






595 







1 






vax 


oer 


Tyr 


OCX 


loo 








Pro 


ser 


pne 


jrne 


Val 


vai 


Tur 


Leu 








200 


Gly 


Ser 


Asp 


Asp 










Leu 


Ala 


Asn 


ser 




*i ii A 
23 0 






Gin 


nlS 


Asp 


ixir 


245 








Arg 


Gin 


vai 


Asn 


Ala 


Arg 


Ala 


vai 








280 


Ser 


Pro 


Lys 


Val 






295 




Val 


Met 


Thr 


Leu 




OTA 






Leu 


Gin 


Arg 


Gly 


325 








Arg 


Leu 


Ala 


lieu 


Glu 


Leu 


Asp 


Leu 








360 


Asp 


Tyr 


He 


Met 




s 


0 1 c 
375 




Ser 


Ala 


Gly 


Gin 




O OA 






Tyr 


Ser 


vai 


Ala 


405 








His 


Cys 


His 


Thr 


Met 


Tyr 


Asn 


Met 








440 


Ala 


Lys 


Gly 


Ser 






A C C 




Gin 


Ser 


Pro 


Tnr 










Leu 


Gin 


Leu 


Gin 










Val 


Ser 


Gin 


Cys 


Lys 


Gly 


pne 


His 








520 


Ser 


Tyr 


Arg 


Lys 






535 




Asp 


Gin 


Trp 


Ser 




550 






Lys 


Phe 


Leu 


Glu 


565 








Leu 


Leu 


Leu 


Ser 


Phe 


Val 


His 


His 



600 



155 



Ala 


Ser 


Met Asp 




X / w 




Aro 


Thr 


Val Pro 


185 






Leu 


Gin 


Asn. Phe 


Asp 


Tyr 


Gly Arg 






220 


Arg 


m V 

vjxy 


xxc v«yD 






^•.9 3 


OCX 


Glv 

VJX jr 


Qln Gin 








m n 

wXix 


Ser 


Tivs Val 
0 -V ax 


4bOD 








OCX 


TiAii Pli^ 
AiC U f u>c 


Trp 


Val 


Ala Ser 






300 


Pro 


Asn 


xXc Axa 






3 ±3 


Ax a 


Leu 


JjcU JrxO 




0 0 n 




Al a 




TV ciT^ T3iK*/*\ 
IrxU 










r&l 11 

wxU 


Ar-o Va1 


Leu 


Gin 


Asn Leu 






380 


Leu 


TT-t a 


XxX 0 wXU 






OQi; 


V7xll 


&1 9 

Axa 


JJCU IlxB 




AT A 

4 JLU 




OCX 


OX u. 


Pfn Val 

Ir±. LJ vax 








Ser 


Phe 


Arg Ala 


Val 


Asp 


Met Glu 






460 




Val 
veix 


XICLl AlXS 






4/3 


IlxB 


OCX 


Tt\rct 

U D I'iC U 




AO A 




OCX 




Ol n PvH 


C A C 
3U3 






OCl. 


uye 


^jro xjrx 


His 


Pro 


Asp Asp 






540 


Pro 


Glu 


Lys Ser 






555 


Leu 


Ala 


T3np Gly 




570 




Leu 


Ala 


Leu Gly 


585 






Arg 


Asp 


Ser Pro 



160 





Leu 


Ser 


Asp 






175 




Ser 


Asp 


Arcr 


Val 




T QA 






OCX 


irp 


Asn 


xrp 


^ AR 

AV3 








Gin 

V7JLU 


Glv 


XICU 


OCX 


He 


Ala 


His 


Glu 








240 


XJCU 


Glv 
ox jr 


TiVfl 

XJ JF 0 


Val 
V ax 






0 cc 




GT n 


Val 
vax 


Val 
Vax 


Val 
vax 




0*7 A 






Cot- 
OCX 


Tyr 


Ser 


Tl A 

XXc 










Gl 11 


Ser 


xrp 


Leu 


Arg 


Val 


Gly 


Thr 








320 


(iXU 


Fne 


Ser 


UxS 






0 Q C 

3 3 3 




xnr 


DVi A 


Cys 


n 1 a 
AX a 




OCA 






ncu 


rtl V 
wxy 


Pro 


Arg 


o^c 








Q Al^ 

OCX 


C A 

OCX 


Gl V 


XJCU 


He 


Phe 


Ala 


Thr 








400 


Asn 


xxxx 


Leu 


Gl n 
0x11 






A T C 
4 J.3 




Gin 


XrX w 




Gin 

ox XI 




A 0 A 

43 0 








Asn 


XICU 


Thr 












Ann 


XJCU 


xiy 0 


Thr 


Val 


Gly 


Thr 








480 


Tyr 


Trp 


Pro 


Glv 

oxy 






433 




Lys 


Asp 


ral V 
o±y 


nl n 




CIA 
3 J.0 






Asp 


Cys 


Va 1 
Vax 


Asp 


3^3 








irne 


mr 


Cys 


xnr 


Thr 


Thr 


Cys 


Leu 








560 


Glu 


Pro 


Ala 


Val 






575 




Leu 


Val 


Leu 


Ala 




590 






Leu 


Val 


Gin 


Ala 



60S 



24 



wo 2005/015158 



PCTAJS2004/025459 



Ser 


Gly 


Gly Pro Leu Ala 


Cys 


Phe 




610 




615 




Cys 


Leu 


Ser Val Leu Xteu 


Phe 


Pro 


625 




630 






Leu 


Ala 


Gin Gin Pro Leu 


Ser 


His 






645 






Thr 


Leu 


Phe Leu Gin Ala 


Ala 


Glu 






660 






Leu 


Ser 


Trp Ala Asp Arg 


Leu 


Ser 






675 




680 


Trp 


Leu 


Val Val Leu Leu 


Ala 


Met 




690 




695 




Tro 


Tvr 


Leu Val Ala Phe 


Pro 


Pro 


705 




710 






Leu 


Pro 


Thr Glu Ala Leu 


Val 


His 






725 






Phe 


Glv 


Leu Ala His Ala 


Thr 


Asn 






740 






Leu 


Glv 


Thr Phe Leu Val 


Ara 


Ser 






755 




760 




Glv 


Leu Thr Phe Ala 


Met 


Leu 




770 




775 




Phe 


Val 


Pro Leu Leu Ala 


Asn 


Val 


785 




790 






Gin 


Met 


Gly Ala Leu Leu 


Leu 


Cys 






805 






His 


Leu 


Pro Arg Cys Tyr 


Leu 


Leu 






820 






Pro 


Glu 


Phe Phe Leu Gly 


Gly 


Gly 






835 




840 


Asp 


Gly 


Asn Thr Gly Asn 


Gin 


Gly 




850 




655 





<210> 20 
<211> 841 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 



<400> 20 



Met 


Leu 


Leu 


Cys 


Thr 
5 


Ala 


Arg 


Leu 


1 
Cys 


Cys 


Trp 


Ala 


Phe 


Ala 


Cys 


His 








20 










Thr. 


Leu 


Pro 


Gly 


Asp 


Tyr 


Leu 


Leu 






35 










40 


Gly 


Cys 


Leu 


Gin 


Val 


Arg 


His 


Arg 




50 










55 




Ser 


Cys 


Ser 


Phe 


Asn 


Glu 


His 


Gly 


65 










70 






Leu 


Gly 


Val 


Glu 


Glu 


lie 


Asn 


Asn 










85 








Thr 


Leu 


Gly 


Tyr 


Gin 


Leu 


Tyr 


Asp 








100 











Gly 


Leu 


Val 


Cys 


Leu 


Gly 


Leu 


Val 








620 










Gly 


Gin 


Pro 


Ser 


Pro 


Ala 


Arg 


Cys 






635 










640 


Leu 


Pro 


Leu 


Thr 


Gly 


Cys 


Leu 


Ser 




650 










655 




lie 


Phe 


Val 


Glu 


Ser 


Glu 


Leu 


Pro 


665 










670 






Gly 


Cys 


Leu 


Arg 


Gly 


Pro 


Trp 


Ala 










685 








Leu 


Val 


Glu 


Val 


Ala 


Leu 


Cys 


Thr 








700 










Glu 


Val 


Val 


Thr 


Asp 


Trp 


His 


Met 






715 










720 


Cvs 


Aro 


Thr 


Arcr 


Ser 


Trp 


Val 


Ser 




730 










735 




Ala 


Thr 


Leu 


Ala 


Phe 


Leu 


Cvs 


Phe 


745 










750 






Gin 


Pro 


Gly 


Arcr 


Tvr 


Asn 


Aro 


Ala 










765 








Ala 


Tvr 


Phe 


Xle 


Thr 


Trp 


Val 


Ser 








780 










Gin 


Val 


Val 


Leu 


Arg 


Pro 


Ala 


Val 






795 










800 


Val 


Leu 


Gly 


He 


Leu 


Ala 


Ala 


Phe 




810 










815 




Met 


Arg 


Gin 


Pro 


Gly 


Leu 


Asn 


Thr 


825 










830 






Pro 


Gly 


Asp 


Ala 


Gin 


Gly 


Gin 


Asn 



845 



Lys His Glu 



Sequence; note « 



Val 


Oly 

10 


Leu 


Gin 


Leu 


Leu 


He 

15 


Ser 


Ser 


Thr 


Glu 


Ser 


Ser 


Pro 


Asp 


Phe 


25 










30 






Ala 


Gly 


Leu 


Phe 


Pro 
45 


Leu 


His 


Ser 


Pro 


Glu 


Va:l 


Thr 
60 


Leu 


Cys 


Asp 


Arg 


Tyr 


His 


Leu 
75 


Phe 


Gin 


Ala 


Met 


Arg 
80 


Ser 


Thr 
90 


Ala 


Leu 


Leu 


Pro 


Asn 
95 


He 


Val 


Cys 


Ser 


Asp 


Ser 


Ala 


Asn 


Val 


105 










110 







25 
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Tyr 


Ala 


Thr 


Leu 


Arg 


Val 


Leu 


Ser 


Leu 


Pro 


Qiy 


Gin 


HIS 


nlS 


lie 


GIU 






115 










ion 
120 


















Leu 


Gin 


Gly 


Asp 


Leu 


Leu 


His 


Tyr 


Ser 


Pro 


Thr 


vai 


Leu 


ilia 


vai 


lie 




130 










IOC 
1J5 




















Gly 


Pro 


Asp 


Ser 


Tnr 


Asn 


Arg 


i\ia 


iu.ai 


lUM 




ai a 


ai a 


Leu 


Leu 


Ser 












ten 
150 






















Pro 


Pne 


Leu 


val 


Pro 


Met 


lie 


ber 


Tyr 






Ser 


Got* 
OCX 


vrXU 




Leu 










165 










X / o 










X / 5 




Ser 


Val 


Lys 


Arg 


Gin 


Tyr 


Pro 


Ser 


Irue 


Leu 


Arg 


Tnr. 




Pro 


Ash 


Asp 








180 










1 Q C 

lo5 










X70 






Lys 


Tyr 


Gin 


Val 


Glu 


Thr 


Met 


Val 


Leu 


Leu 


Leu 


Qin 


Lys 


pne 


Qiy 


Trp 






195 










200 










one 
205 








Thr 


Trp 


lie 


Ser 


Leu 


Val 


Gly 


Ser 


Ser 


Asp 


Asp 


Tyr 


Giy 


Gin 


Leu 


Gly 




210 










215 










u 










Val 


Gin 


Ala 


Leu 


Glu 


Asn 


Gin 


Ala 


inr 


\jriy 


oin 


vjiy 


i.i.e 


Cys 


Tl *a 

lie 


iiia 


o o c 

22b 










O O 

2 J U 










9 0 R 












Phe 


Lys 


Asp 


lie 


Met 


Pro 


Pne 


Ser 


Ala 


Gin 


vai 


vaiy 


Asp 


GIU 


TV 

Arg 


nec 










o A e 
245 










Z5U 














Gin 


Cys 


Leu 


Met 


Arg 


His 


Leu 


Ala 


Csin 


Aia 


Qiy 


iu.a 


inr 


vai 


vax 


vai 








260 










O £ c 

2o5 










2/0 






Val 


Phe 


Ser 


Ser 


Arg 


Ml mm. 

Gin 


Leu 


Ala 


Arg 


Val 


Phe 


Pne 


GIU 


Ser 


Val 


Val 






275 










280 










265 








Leu 


Thr 


Asn 


Leu 


Thr 


Gly 


Lys 


Val 


Trp 


Val 


Ala 


Ser 


Glu 


Ala 


Trp 


Ala 




290 










295 










9 A A 

300 










Leu 


Ser 


Arg 


His 


lie 


Thr 


Gly 


Val 


Pro 


Gly 


He 


Gin 


Arg 


He 


Gly 


Met 


305 










310 










315 










0 0 A 

320 


Val 


Leu 


Gly 


Val 


Ala 


He 


Gin 


Lys 


Arg 


Ala 


Val 


Pro 


Gly 


Leu 


Lys 


Ala 










325 










•a "5 A 
330 










335 




Phe 


Glu 


Glu 


Ala 


Tyr 


Ala 


Arg 


Ala 


Asp 


Lys 


Lys 


Ala 


Pro 


Arg 


Pro 


Cys 








340 










345 










^ e A 
350 






His 


Lys 


Gly 


Ser 


Trp 


Cys 


Ser 


Ser 


Asn 


Gin 


Leu 


Cys 


Arg 


Glu 


Cys 


Gin 






355 










360 










o f e 
3o5 








Ala 


Phe 


Met 


Ala 


His 


Thr 


Met 


Pro 


Lys 


Leu 


Lys 


Ala 


Phe 


Ser 


Met 


Ser 




370 










375 










O O A 

3o0 










Ser 


Ala 


Tyr 


Asn 


Ala 


Tyr 


Arg 


Ala 


Val 


Tyr 


Ala 


Val 


Ala 


His 


Gly 


Leu 


385 




















0^75 










Ann 


His 


Gin 


Leu 


Leu 


Gly 


Cys 


Ala 


Ser 


Gly 


TV 1 — 

Ala 


Cys 


Ser 


Arg 


Gly 


Arg 


vai 










405 










410 










415 




Tyr 


Pro 


Trp 


Gin 


Leu 


Leu 


Glu 


Gin 


lie 


His 


Lys 


vai 


HIS 


pne 


Leu 


Leu 






420 










A*D R 

425 










A "1 n 






His 


Lys 


Asp 


Thr 


Val 


Ala 


Pne 


Asn 


Asp 


Asn 


TV iv*/^ 

Arg 


Asp 


Pro 


Leu 


Ser 


Ser 






>l ^ c 

435 










AAA 


















Tyr 


Asn 


lie 


lie 


Ala 


Trp 


Asp 


Trp 


TVsn 


Gly 


Pro 


Lys 


Trp 


Thr 


Pne 


Thr 




450 










>i c e 
455 










A^n 










Val 


Leu 


Gly 


Ser 


Ser 


Thr 


Trp 


Ser 


Pro 


Val 


Gin 


Leu 


Asn 


lie 


Asn 


Glu 


465 










470 










A rf e 
475 










4o0 


Thr 


Lys 


lie 


Gin 


Trp 


His 


Gly 


Lys 


Asp 


Asn 


Gin 


Val 


Pro 


Lys 


Ser 


Val 










485 










490 










495 




Cys 


Ser 


Ser 


Asp 


Cys 


Leu 


Glu 


Gly 


His 


Gin 


Arg 


Val 


Val 


Thr 


Qiy 


Phe 








c; no 




















510 






His 


His 


Cys 


Cys 


Phe 


Glu 


Cys 


Val 


Pro 


Cys 


Gly 


Ala 


Gly 


Thr 


Phe 


Leu 






515 










520 










525 








Asn 


Lys 


Ser 


Asp 


Leu 


Tyr 


Arg 


Cys 


Gin 


Pro 


Cys 


Gly 


Lys 


Glu 


Glu 


Trp 




530 










535 










540 










Ala 


Pro 


Glu 


Gly 


Ser 


Gin 


Thr 


Cys 


Phe 


Pro 


Arg 


Thr 


Val 


Val 


Phe 


Leu 


545 










550 










555 










560 


Glu 


Trp 


His 


Glu 


Pro 


He 


Ser 


Leu 


Val 


Leu 


He 


Ala 


Ala 


Asn 


Thr 


Leu 
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565 570 575 

Leu Leu Leu Leu Leu Val Gly Thr Ala Gly Leu Phe Ala Trp His Phe 

580 585 590 

His Thr Pro Val Val Arg Ser Ala Gly Gly Arg Leu Cys Phe Leu Met 

595 600 605 

Leu Gly Ser Leu Val Ala Gly Ser Cys Ser Phe Tyr Ser Phe Phe Gly 

610 615 620 

Qlu Pro Thr Val Pro Ala Cys Leu Leu Arg Gin Pro Leu Phe Ser Leu 
625 630 635 640 

Gly Phe Ala He Phe Leu Ser Cys Leu Thr lie Arg Ser Phe Gin Leu 

645 650 655 

Val He He Phe Lys Phe Ser Thr Lys Val Pro Thr Phe Tyr Arg Thr 

660 665 670 

Trp Ala Gin Asn His Gly Ala Gly Leu Phe Val He Val Ser Ser Thr 

675 680 685 

Val His Leu Leu He Cys Leu Thr Trp Leu Val Met Trp Thr Pro Arg 

690 695 700 

Pro Thr Arg Olu Tyr Gin Arg Phe Pro His Leu Val He Leu Glu Cys 
705 710 715 720 

Thr Glu Val Asn Ser Val Gly Phe Leu Leu Ala Phe Thr His Asn He 

725 730 735 

Leu Leu Ser He Ser Thr Phe Val Cys Ser Tyr Leu Gly Lys Glu Leu 

740 745 750 

Pro Glu Asn Tyr Asn Glu Ala Lys Cys Val Thr Phe Ser Leu Leu Leu 

755 760 765 

Asn Phe Val Ser Trp He Ala Phe Phe Thr Met Ala Ser He Tyr Gin 

770. 775 780 

Gly Ser Tyr Leu Pro Ala Val Asn Val Leu Ala Gly Leu Thr Thr Leu 
785 790 795 800 

Ser Gly Gly Phe Ser Gly Tyr Phe Leu Pro Lys Cys Tyr Val He Leu 

805 810 815 

Cys Arg Pro Glu Leu Asn Asn Thr Glu His Phe Gin Ala Ser He Gin 

820 825 830 

Asp Tyr Thr Arg Arg Cys Gly Thr Thr 
835 840 

<210> 21 
<211> 840 
<212> PRT 

<213> Artificial Seq[uence 
<220> 

<223> Description of Artificial Sec[uence; note » 
synthetic construct 



<400> 21 














Met Leu Phe 


Trp 


Ala Ala His 


Leu Leu 


Leu 


Ser 


Leu Gin Leu Val Tyr 


1 




5 




10 




15 


Cys Trp Ala 


Phe 


Ser Cys Gin 


Arg Thr 


Glu 


Ser 


Ser Pro Gly Phe Ser 




20 




25 






30 


Leu Pro Gly 


Asp 


Phe Leu Leu 


Ala Gly 


Leu 


Phe 


Ser Leu His Gly Asp 


35 






40 






45 


Cys Leu Gin 


Val 


Arg His Arg 


Pro Leu 


Val 


Thr 


Ser Cys Asp Arg Pro 


50 




55 








60 


Asp Ser Phe 


Asn 


Gly His Gly 


Tyr His 


Leu 


Phe 


Gin Ala Met Arg Phe 


65 




70 






75 


80 


Thr Val Glu 


Glu 


He Asn Asn 


Ser Ser 


Ala 


Leu 


Leu Pro Asn He Thr 
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85 



JJCU 






Glu 






Asp 


Val 


Cvs 








100 










105 


Ala 




Leu 




Val 


Leu 


Ala 


Leu 


Gin 
















120 




Gin 


Lys 


Asp 


Leu 


Al"CT 


Asn 


His 


Ser 


Ser 




130 










135 






Piro 


Asp 


Asn 


Thr 


Asp 


His 


Ala 


Val 


Thr 


1 At; 










1 KO 
X3 V 








"□"hp* 


JJCLl 




JcrX \J 


JJCU. 


Val 

V OIX 


OCX 


lyx 


Glu 










JLOO 










Ma 


TiVSI 




Lys 


Phe 


Pro 


Ser 


Phe 


Leu 








1 on 










X09 


nXS 






11 

V9XU 


VaX 


1*16 


Val 
vcix 


Gl n 


T*aii 






J. 










■ a\JU 




Trp 








XX6 


Gl V 
oxy 


Ser 


lyx 


Glv 
oxy 




OT n 

Z xU 










aX3 












Glu 


Glu 

wX U 


JJCU 


Ala 


Val 


Pro 












9*^0 
w 










Asp 


JLXC 


VCLX 


IT I. \J 


XrlXC 


ocx 


Ala 

AXd 












OAK 














ricL. 


UXIl 


iixs 


XiCUl 


Al a 


Gin 


Ala 








O ^ A 












irUe 


SesT 


Asn 


Arg 


XIX D 


T.ail 


Ala 


At-ct 

Airg 


Val 






•> n c 

2/3 










o aft 
z oil 




IVl a 


Asn 


JJCU 


XJUX 


Olv 
wxy 


juy o 


Val 

V CiX 


xt^p 


Val 




'5 Oft 










^ 79 






SSIT 




iyr 


Tl» 

XXB 




Ser 


Val 
vax 


Thr 


Gly 


^ n cc 










1 ft 
J xu 










m V 

vsxy 


V CLX 


Ala 


Val 


Gin 


Gin 


nxy 


Gin 
























Q Al^ 

OCX 


iyr 


Val 
vax 


Axg 


Ala 

AX CI 


Val 

V eix 


Thr 








1 A f\ 
















Cot- 
OCX 


irp 




OCX 


X AIX 


Asn 


Gin 






o c c 










"5 Cft 




File 


Tnir 


inr 


Arg 


Asn 


Mat' 


Pro 


Tiny 

XXIX 


XICU 




*S *7 rt 

370 










3 /O 










Axg 


Val 

V dX 




Glu 


Ala 


Val 


Tvr 
xyx 


3 85 










390 








Gin 


Leu 


Leu 


Gly 


Cys 


Thr 


Ser 


Glu 


He 






















irp 


oXU 


JJCU 


XICU 


Gl n 

V3XXA 


Gin 


He 


xyx 








420 










425 


Glu 


Asn 


Thr 


Val 


Ala 


Phe 


Asp 


Asp 


Asn 






435 










440 




Asp 






a 


-Lxp 


Hem 




Afin 

AO XX 


Glv 




450 
















lie 


v3iy 


ser 


AX a 


Ser 


Leu 


Ser 


pro 


Val 
VaX 


465 








470 








Lys 


He 


Gin 


Trp 


His 


Gly 


Lys 


Asn 


Asn 










485 










Thr 


Thr 


Asp 


Cys 


Leu 


Ala 


Gly 


His 


His 








500 










505 


His 


Cys 


Cys 


Phe 


Glu 


Cys 


Val 


Pro 


Cys 






515 










520 
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90 95 



Ser 


Glu 


Ser 


Ala 


Asn 


Val 


Tvr 

xyx. 










110 






Glv 


Pro 


ArcT 


His 


He 


Glu 


He 








125 








LVS 


Val 


Val 


Ala 


Phe 


He 


Gly 






140 










Thr 


Ala 


Ala 


Leu 


Leu 


Gly 


Pro 




155 










160 


Ala 


Ser 


Ser 


Val 


Val 


Leu 


Ser 


1 7n 

X / V 










X / 




Axy 


Thr 


Val 


Pro 


Ser 


Asp 


Arcr 

Axy. 










1 QO 

X7 W 






Tkaii 

XJCU 


Gin 

wXXX 


OCX 


XrXXC 


Qlv 
vrxy 


Tno 
xxp 


Val 

V ax 
















Ann 


xyt 


Qlv 
wxy 


Gin 
wxxx 


Tieu 

XICU 


Glv 


Val 
V ax 
















Aty 


Glv 


He 


Cys 


Val 


Ala 


Phe 




<b O 9 










240 


Val 


Glv 
vjxy 


Asp 


Pro 


Axy 


Met 


Gin 
















Airo 
^w.y 


Thr 


Thr 


Val 


Val 


Val 


Val 










9*7 ft 






T>>ia 

IrXXC 


IrXXC 


Arg 


OCX 


Val 
V ax 


Val 
V ax 


xicu 
















Ala 
AX a 


OCX 


Glu 


Asp 


Tim 
xxp 


Ala 


He 






300 

•9 U U 










Tl 
xxc 


Gin 


Glv 
vaxy 


Tie 
xxc 


Glv 
wxy 


Thr 
xxxx 


Val 




J X 9 










320 


Val 


PT*r\ 

XrX w 


Glv 
v»xy 


Leu 


Lys 


Glu 


Phe 


330 

J J u 










335 




Ala 


Ala 


Pro 


Ser 


Ala 


Cys 


Pro 










3 1^ ft 
J 3 U 






XlCU 


^y w 


Arg 


GTu 

ox LX 


v-.y o 


His 


Thr 

XXIX 








3^^R 
J O 3 








Qlv 
vaxy 


Ala 


Phe 


Ser 


Met 


Ser 


Ala 






3 Aft 










Al a 
nXa 


Val 
Vax 


Al a 
nXa 


His 

AXO 


Glv 
\3xy 


xicu 


His 




395 










400 


Cys 


Ser 


Arg 


Gly 


Pro 


Val 


Tyr 


410 










415 




Xljr o 


Val 
■ vax 


Ann 

APXl 


Phe 

JrXXC 


Leu 


XJC u 


His 










430 






Gly 


Asp 


Thr 


Leu 


Gly 


Tyr 


Tyr 








445 








r^xv 


Glu 

wX LX 


Tim 


ThT 
xxxx 


Phe 


Glu 


Tie 
xxc 
















His 


Leu 


Asp 


He 


Asn 


Lys 


Thr 




475 










480 


Gin 


val 


Pro 


Val 


Ser 


Val 


Cys 


490 










495 




Arg 


Val 


Val 


Val 


Gly 


Ser 


His 










510 






Glu 


Ala 


Gly 


Thr 


Phe 


Leu 


Asn 



525 
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Met 


Ser 


Glu 


Leu 


His 


He 


Cys 


Gin 




530 










535 




Pro 


Lys 


Glu 


Ser 


Thr 


Thr 


Cys 


Phe 


545 










550 






Leu 


Arg 


Glu 


His 


Thr 


Ser 


Trp 


Val 










565 








Leu 


Leu 


Leu 


Leu 


Leu 


Gly 


Thr 


Ala 








580 










Thr 


Pro 


Val 


Val 


Arg 


Ser 


Ala 


Gly 






595 










600 


Gly 


Ser 


Leu 


Ala 


Ala 


Gly 


Ser 


Gly 




610 










615 




Pro 


Thr 


Arg 


Pro 


Ala 


Cys 


Leu 


Leu 


625 










630 






Phe 


Thr 


He 


Phe 


Leu 


Ser 


Cys 


Leu 










645 








He 


He 


Phe 


Lys 


Phe 


Ser 


Thr 


Lys 








660 










Val 


Gin 


Asn 


His 


Gly 


Ala 


Gly 


Leu 






675 










660 


Gin 


Leu 


Leu 


He 


Cys 


Leu 


Thr 


Trp 




690 










695 




Ala 


Arg 


Glu 


Tyr 


Gin 


Arg 


Phe 


Pro 


705 










710 






Glu 


Thr 


Asn 


Ser 


Leu 


Gly 


Phe 


He 










725 








^Leu 


Ser 


He 


Ser 


Ala 


Phe 


Ala 


Cys 








740 










Glu 


Asn 


Tyr 


Asn 


Glu 


Ala 


Lys 


Cys 






755 










760 


Phe 


Val 


Ser 


Trp 


He 


Ala 


Phe 


Phe 




770 










775 




Lys 


Tyr 


Leu 


Pro 


Ala 


Ala 


Asn 


Met 


785 










790 






Ser 


Gly 


Phe 


Gly 


Gly 


Tyr 


Phe 


Leu 










805 








Arg 


Pro 


Asp 


Leu 


Asn 


Ser 


Thr 


Glu 








820 










Tyr 


Thr 


Arg 


Arg 


Cys 


Gly 


Ser 


Thr 






835 










840 



<210> 22 
<211> 838 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 

<400> 22 

Met Gly Pro Arg Ala Lys Thr He 

1 5 
Val Leu Ala Glu Pro Ala Glu Asn 
20 

Tyr Leu Leu Gly Gly Leu Phe Ser 
35 40 



Pro 


Cys 


Gly 


Thr 


Glu Glu 


Trp 


Ala 








540 








Pro 


Arg 


Thr 


Val 


Glu Phe 


Leu 


Glu 






555 








560 


Leu 


Leu 


Ala 


Ala 


Asn Thr 


Leu 


Leu 




570 








575 




Gly 


Leu 


Phe 


Ala 


Trp His 


Leu 


Asp 


585 








590 






Gly 


Arg 


Leu 


Cys 


Phe Leu 


Met 


Leu 










605 






Ser 


Leu 


Tyr 


Gly 


Phe Phe 


Gly 


Glu 








620 








Arg 


Gin 


Ala 


Leu 


Phe Ala 


Leu 


Gly 






635 








640 


Thr 


Val 


Arg 


Ser 


Phe Gin 


Leu 


He 




650 








655 




Val 


Pro 


Thr 


Phe 


Tyr His 


Ala 


Trp 


665 








670 






Phe 


Val 


Met 


lie 


Ser Ser 


Ala 


Ala 










685 






Leu 


Val 


Val 


Trp 


Thr Pro 


Leu 


Pro 








700 








His 


Leu 


Val 


Met 


Leu Glu Cys 


Thr 






715 








720 


Leu 


Ala 


Phe 


Leu 


Tyr Asn Gly 


Leu 




730 








735 




Ser 


Tyr 


Leu 


Gly 


Lys Asp Leu 


Pro 


745 








750 






Val 


Thr 


Phe 


Ser 


Leu Leu 


Phe 


Asn 










765 






Thr 


Thr 


Ala 


Ser 


Val Tyr Asp 


Gly 








780 








Met 


Ala 


Gly 


Leu 


Ser Ser 


Leu 


Ser 






795 








800 


Pro 


Lys 


Cys 


Tyr 


Val He 


Leu 


Cys 




810 








815 




His 


Phe 


Gin 


Ala 


Ser He 


Gin 


Asp 


825 








830 







Sequence; note = 



Cys Ser Leu Phe Phe Leu Leu Trp 

10 15 
Ser Asp Phe Tyr Leu Pro Gly Asp 
25 30 
Leu His Ala Asn Met Lys Gly He 
45 



29 
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Val 


His 


Leu 


Ash 


Phe 


Leu 


Gin 


Val 




50 










55 






Val 


Tie 
xxc 


Glv 




Asn 


Leu 


Met 


65 










70 






GlU 


Tie 




Ash 




Ser 


Ser 


Leu 










85 








Glu 


lie 


Val 


Asp 


Val 


Cys 


Tvr 


He 








100 










Tvx 


Phe 


Leu 


Ala 


His 


Glu 


Asp 


Asn 






115 










120 


Ser 


Ash 


±yt 


He 

xxc 


Ser 




Val 


Val 














135 




Glu 


OCX 


Val 

VAX 


I*1C u 


XXIX 


Val 

VAX 


Ala 

AXA 


Asn 


145 










150 






Gin 


xxc 


Thy 

X IIX 


'Wry 


AT* 
OCX 


Ala 

nx A 


Tie 

xxc 


Ser 










165 








Phe 


PSTO 


Ala 


Leu 


Leu 




Thr 


Thr 








180 










Ala 


Met 


Val 


Gin 


Leu 


Met 


Leu 


His 






195 










200 




Va 1 

VAX 


Cat* 
OCX 


OCX 


Ann 


Jixxx 


Tvr 


Glv 
«xy 




210 

^ X V 










215 




m n 


A2rcj 


Val 
VAX 


211 a 

AXA 


Arg 


Arg 




Tie 
xxc 












A .9 V 






PlTO 


lllx 


Leu 


ni n 

UXIl 


Pro 


Asn 


Gl n 

wXXX 


Aan 
no XX 










245 








Li6U 


Val 
Vox 


X IIX 


Tl e 
xxc 


Val 

V AX 




T.vd 
J 


TiAII 
XJC u 








260 












Val 

VAX 


XrXlC 


. OCX 


rrx v>/ 


Asp 


Leu 


Thr 






275 










280 


JJCU 




Gin 


Asn 


Phe 


Thr 


Glv 


Ala 




290 










295 






Tie 

X xc 


Ann 


XrX W 


Val 


Leu 


His 


Asn 


w w 9 










310 








p>t@ 
irxiB 


xtcu 


Glv 

wX jr 


He 


Thr 

XIXX 


Tie 

X xc 


Gin 










325 










JTIXC 




Gl 11 
\7X U 


Tyn 


Glv 


Pro 

<rX V/ 


Gin 








340 












OCX 


Gl n 


Q AT" 
OCX 




Thr 

XXIX 




Actn 

no XX 






355 










360 




XXlX 


XICLX 


Q AY* 
OCX 


pVia 
j> xxc 


Aan 

AD XX 


Thr 


He 




370 










375 




Val 


lyx 


OCX 


Val 

VAX 




Ser 


Ala 


Val 


385 










390 






Q A V 
OCX 


XICU 


ucu 


Glv 
wxy 


Cys 


A on 
Ad ^ 


TiVH 
xijr o 


Ser* 

OCX 










405 








Pz*o 


irp 


Gin 


ucu 


ucu 


Glu 

WXU 


Glu 

wX U 


Tie 
xxc 








4 A U 










Asp 


XlXS 


VJXIX 


Tl A 

Xxc 


PVia 
JrXlc 


PllA 

irxxc 


.Asp 


Pro 






435 










440 


Glu 


He 


Val 


Gin 


Trp 


Gin 


Trp 


Asp 




450 










455 




Val 


Ala 


Ser 


Tyr 


Tyr 


Pro 


Leu 


Gin 


465 










470 






lie 


Ser 


Trp 


His 


Thr 


val 


Asn 


Asn 










485 








Lys 


Arg 


Cys 


Gin 


Ser 


Gly 


Gin 


Lys 



Pro 


Met Cvs Lvs Glu 


Tvr Glu Val 




60 




Gin 


Ala Met Aro Phe 


Ala Val Glu 




75 


80 


Leu 


Pro Gly Val Leu 


Leu Olv Tvr 




90 


95 


Ser 


Asn. Asn Val Gin 


Pro Val Leu 


105 




110 


Leu 


Leu Pro He Gin 


Glu ASD Tvr 




125 




Ala 


Val He Glv Pro 


Asn Asn Ser 

«nM3*Jk OWj> 




140 




Phe 


Leu Ser Leu Phe 

MWW AifwU JFXA%^ 


Leu Leu Pro 

XJC U XJC iX f X w 




155 


160 


Asn 


Glu Tieu Arcr Asn 

NJXVX XJCWL XVL^ fXO^ 


T«v« X^a 1 AT"fT 
xjjrB VAX ax^ 




170 


175 


Pro 


Ser Ala Aso His 


His Val Glu 


185 




190 


Phe 


Ara Tm Asn Tm 


He lie Val 




205 




Arcf 
Atg 


Asn Asn Glv Gin 
AO XX wxjr wxxx 


Leu Leu Gl v 
xjci* xjcu wxy 




220 




fva 
wya 


Tl A Ala PHa Gin 

xxc ^^XA «XXC wXXX 


Glu 'Thr TtAU 

\JXU XXXX XICU 




235 


240 


nc l« 


Tlii^ Qay" rtl n rSl n 
XXIX OCX \3XU UXIX 


Ai^rr rtl n Av^rr 
nxg v7XxX Axg 




250 


255 


Gin 


Gin Sp*r Thr Al a 

wXXX OCX XXXX fU.A 


Aro Val Val 

J*1X^ VAX VAX 


265 




270 


Leu 


Tvr His Phe Phe 

jrxAG fXAw 


Asn Glu' Val 




285 




Val 


Tm He Ala Ser 


Glu Ser Tm 

X^k/ 




300 




Leu 


Thr Glu Leu Glv 


His Leu Glv 

AA^f9 AJwu \j^y 




315 


320 


Ser 


Va 1 Pro T 1 A Pro 
V n JL Jrxw xxc JT^XW 


Glv PhA Qat* 

V7Xy JCrXXC OCX 




330 


335 


Ala 


Gl V Pm Pro Pm 
vjxy fxv JTxw JT^xv 


TiAii QAr Arcr 
XJCLX OCX xu>^ 


345 




350 


Gin 


Glu Cvs Asn Asn 


r'vs TiAU Asn 




365 




Leu 


Ara Leu Ser Glv 


Glu Ara Val 

wX U AX ^ V AX 




380 




Tvr 

xy X 


Ala Val Ala His 


Ala. Leu His 




395 


400 


Thr 


Cvs Thr Lvs Ara 


Val Val Tvr 

VAX VAX. xyx 




410 


415 


Tm 


Lvfi Val Asn Ph^ 

XJJr B V AX ABXX f XXC 


Thr Tieu Tieu 

XXXX XJCU XICIX 


425 




430 


Gl n 

wXXl 


Gl V A an Va 1 ■ A 1 a 
V7Xy ABp VAX xxXa 


XicU IlXB XiCU 




445 




Arci 


Ser Gin Asn Pro 


Phe Gin Ser 




460 




Arg 


Gin Leu Lys Asn 


He Gin Asp 




475 


480 


Thr 


He Pro Met Ser 


Met Cys Ser 




490 


495 


Lys 


Lys Pro Val Gly 


He His Val 
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500 










505 






510 


Cys Cys Phe 


Glu 


Cys 


Xle^ Asp 


Cys 


Leu 


Pro Gly 


Thr Phe 


Leu Asn His 


515 










520 






525 




Thr Glu Asp 


Glu 


Tyr 


Glu 


Cys 


Gin Ala 


Cys Pro 


Asn Asn 


Glu Trp Ser 


530 








535 








540 




Tyr Gin Ser 


Glu 


Thr 


Ser 


Cys 


Phe Lys 


Arg Gin 


Leu Val 


Phe Leu Glu 


545 






550 








555 




560 


His Glu Val 


Pro 


Thr 


He 


Val 


Val 


Ala 


He Leu 


Ala Ala 


Leu Gly Phe 






565 










570 




575 


Phe Ser Thr 


Leu 


Ala 


He 


Leu 


Phe 


He 


Phe Trp 


Arg His 


Phe Gin Thr 




580 










585 






590 


Pro Met Val 


Arcr 


Ser 


Ala 


Gly 


Gly Pro 


Met Cys 


Phe Leu 


Met Leu Val 


595 










600 






605 




Pro Leu Leu 


Leu 


Ala 


Phe 


Gly 


Met 


Val 


Pro Val 


Tyr Val 


Gly Pro Pro 


610 








615 








620 




Thr Val Phe 


Ser 


Cvs 


Phe 


Cys 


Arg Gin 


Ala Phe 


Phe Thr 


Val Cys Phe 


625 






630 








635 




640 


Ser He Cys 


Leu 


Ser 


Cvs 


He 


Thr Val 


Arg Ser 


Phe Gin 


He Val Cys 






645 










650 




655 


Val Phe Lvs 


Met 


Ala 


Arcr 


Ara 


Leu 


Pro 


Ser Ala 


Tyr Ser 


Phe Tim Met 




660 










665 






670 


Arcr Tvr His 


Glv 


Pro 


Tvr 


Val 


Phe 


Val 


Ala Phe 


He Thr 


Ala He Lvs 


675 










680 






685 




Val Ala Leu 


Val 


Val 


Gly 


Asn 


Met 


Leu 


Ala Thr 


Thr He 


Asn Pro Tie 


690 








695 








700 




Gly Arg Thr 


Asp 


Pro 


Asp 


Asp 


Pro 


Asn 


He Met 


He Leu 


Ser Cys His 


705 






710 








715 




720 


Pro Asn Tyr 


Ara 


Asn 


Glv 


Leu 


Leu 


Phe 


Asn Thr 


Ser Met 


Asp Leu Leu 






725 










730 




735 


Leu Ser Val 


Leu 


Glv 


Phe 


Ser 


Phe 


Ala 


Tyr Met 


Glv Lvs 


Glu Leu Pro 




740 










745 






750 


Thr Asn Tvr 


Asn 


Glu 


Ala 


Lvs 


Phe 


He 


Thr Leu 


Ser Met 


Thr Phe Ser 


755 










760 






765 




Phe Thr Ser 


Ser 


He 


Ser 


Leu 


Cys 


Thr 


Phe Met 


Ser Val 


His Asp Gly 


770 








775 








780 




Val Leu Val 


Thr 


He 


Met 


Asp 


Leu 


Leu 


Val Thr 


Val Leu 


Asn Phe Leu 


785 






790 








795 




800 


Ala He Gly 


Leu 


Gly 


Tyr 


Phe 


Gly Pro 


Lys Cys 


Tyr Met 


He Leu Phe 






805 










810 




815 


Tyr Pro Glu 


Arg 


Asn 


Thr 


Ser 


Ala Tyr 


Phe Asn 


Ser Met 


He Gin Gly 




820 










825 






830 


Tyr Thr Met 


Arg 


Lys 


Ser 















835 



<210> 23 
<211> 843 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note a 
synthetic construct 

<400> 23 

Met Gly Pro Gin Ala Arg Thr Leu Cys Leu Leu Ser Leu Leu Leu His 

1 5 10 15 

Val Leu Pro Lys Pro Gly Lys Leu Val Glu Asn Ser Asp Phe His Leu 
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20 



Ala 


Gly 


ASD 

****** 


Tvr 


Leu 


Leu 


Gly 


Gly 






35 










40 


Lys 


Ser 


lie 


Ser 


His 


Leu 


Ser 


Tvr 




50 










55 




Phe 


Thr 


Met 


Lvs 


Val 


Leu 


Gly 


Tyr 


65 










70 






Ala 


Val 


Glu 


Glu 


lie 


Asn 


Asn 


Cvs 










85 








Leu 




xyj. 


Glu 

V7X IX 


Met 


Val 


Asp 


Val 








1 no 

X U V 










Xr<U w 




JJCU 




Phe 

JTXXC 


Leti 

XJGU 


Ala 

AX A 


Gin 
















190 

Xa U 


uy D 




lyr 


Ca V 
OC3x 


G1 n 


iyr 




PT"rt 
xrxw 




JL J U 










1 "^5 
X JO 






2V cm 
no XX 


OCX 


Gl 11 

wXU 


Cat* 

OCX 


Ala 

AX A 


Tie 
xxc 


Thr 












150 

X9 V 








lie 




Gin 

wXXX 


Tie 

xxc 


Thr 


xyx 


OCX 










XwO 








Ly s 




His 


Phe 


Pro 


Ser 


Met 


Leu 








180 










His 


lie 


Glu 


Ala 


Met 


Val 


Gin 


Leu 






X 










200 




VAX 


VAX 


TiAII 
XJCU 


Val 

V AX 


OCX 


Asp 


Asp 














215 
A xo 




4JCU 


TiAii 

XJCiX 


0CX 


Qln 

wXXX 


«xy 


Leu 


llir 


Lvs 

XI o 












910 






wXXX 


VTXiX 


VAX 


TiA11 




He 


Pro 


Glu 




















Gin 




Gin 


Leu 


Asp 


Asn 


He 








PRO 










Ala 




Val 


Val 


Val 


Val 


Phe 


Ser 






Z / 3 










9 ft 0 




fix o 


\jXiX 


Val 

V AX 


TiAn 

XICU 


AT'Cf 

«xy 


■Lx-p 


Asn 




•? on 










9 Q5 




Cay* 


wXU' 


Cat* 

wCX 


xrp 


Ala 

AX A 


He 


Asp 


Pro 












"^1 0 

J X v 






Axg 


His 


xxxx 


Glv 


Thr 


Phe 


Leu 


Glv 


















PSTO 




irxxc 


Cat* 

OCX 


Gin 

VJXXX 


Phe 

IrXXC 




Val 

VAX 








J U 










Val 


Pro 


Asn 


Thr 


Thr. 


Asn 


Leu 


Aro 
















360 


Ala 


Cys 


Ii€U 


Asn 


Thr 


Thr 


Lys 


Ser 




J / v 










375 






Glu 




Val 


Val 


xyx 


Ser 


Val 


•3 OS 










390 






His 


Ala 


Leu 


His 




Leu 


Leu 


Glv 










405 








Gin 


Lya 


Val 


Tyr 


Pro 


Trp 


Gin 


Leu 








420 










Phe 


Thr 


Leu 


Leu 


Gly 


Asn 


Arg 


Leu 






435 










440 


Pro 


Met 


Leu 


Leu 


Asp 


He 


He 


Gin 




450 










455 




Pro 


Phe 


Gin 


Ser 


He 


Ala 


Ser 


Tyr 


465 










470 







25 










30 






Leu 


Phe 


Thr 


Leu 


His 
45 


Ala 


Asn 


Val 


Leu 


Gin 


Val 


Pro 
60 


Lvs 




Asn 


Glu 


Asn 


Leu 


Met 
75 


Gin 

wXXX 


Ala 

AXA 


Met- 


Axg 


Phe 

IT xxc 

80 


Ser 


Ser 

90 


Leu 


Leu 


Pro 


Glv 


Val 

VAX 

95 


Leu 


v^yo 




Leu 


Ser 


Asn 


Asn 


He 


His 


105 










110 






& an 
nay 


nop 




T>A11 
JJCU 


Leu 

1 95 
Xa9 


Pro 


T 1 A 

XXc 


Leu 


nxS 


VAX 


VaX 


AX A 

140 


Val 
VaX 


Tl A 

xxe 


vaxy 


Pro 


vax 


Ser 


Asn 
155 


Tl A 

XXc 


Leu 


Ser 


XT'! a 

nxs 


Dh A 

160 


Hi a 
AXa 


Tl A 

xxc 
170 


Cat> 

OCX 


Asp 


Lys 


Leu 


Arg 
175 


Asp 


& T"rt 


XXXX' 


Val 

VAX 


pT"rt 
irXU 


Cat* 

OCX 


Al a 

AXA 


XXXX 


nxo 


185 










190 








Val 

VAX 


His 

fix 19 


PhA 


Gl n 

V7XXX 

205 




Acn 

AOXX 


xxp 


Asp 


iyr 


vjrxy 


Arg 

220 


Gl « 
uXU 


Aoxx 


C Al^ 
OCX 


■nxo 


XXXX 


Ser 


Asp 
235 


Tl A 

XXc 


Cys 


Tl A 

XXc 


Al a 
AXA 


PVia 
It xxc 

240 


Oat* 
OCX 


Ser 
250 


V7XXX 


17a 1 
VAX 


ncb 


Arg 


Ser 
255 


rsl 11 
uxu 


Leu 


Asp 


Lys 


Leu 


Arg 


Arg 


xhx 


Cat* 
OCX 


265 










270 






cxu 


Gin 

UXIX 


Leu 


Cer 

OCX 


XJCvX 

285 


iyr 


OCX 


Phe 

irXXC 


DT-i A 


THt* 

xxix 


nl \r 
v3xy 


Ph A 

300 


Val 

VAX 


_ 

Trp 


Tl A 
XXC5 


Al a 
AXA 


VAX 


Leu 


TT-I 0 
IIXB 

315 


Asn 


Leu 


Xxlx 


Gl 11 


Leu 
320 


VAX 


lllx 

330 


Tl A 

xxe 


uxn 


Arg 


Val 
VAX 


Ser 
335 


Tl A 

XXc 


Ar^ 


Arg 


Asp 


Lys 


Pro 


nl V 
vjxy 


Tyr 


PT*rk 
xrxw 


345 










350 






XXXX 


Thr* 

XXXX; 


\^y B 


AoXX 


G1 n 
wxxx 

365 


AB^ 


wye 


A ST^ 
Ao^ 


Phe 

IT xxc 


Ann 

AD XX 


Ann 

AOXX 


Tie 
xxc 

380 


TtAll 
JUCvX 


Tie 
xxc 


XJCU 


OCX 


iyr 


Ser 


Ala 
395 


Val 


xyx 


Ala 


Val 

V AX 


Ala 
400 


uys 


A an 

AOXX 

410 


i*xg 


Val 

VAX 




Ova 
v*y D 


Thr* 

XXXX 

415 


xjy D 


Leu 


Arg 


- wtXU 


Tl A 

xxe 


Trp 


nxS 


VaX 


Asn 


425 










430 






Phe 


Phe 


Asp 


Gin 


Gin 
445 


Gly 


Asp 


Met 


Trp 


Gin 


Trp 


Asp 
460 


Leu 


Ser 


Gin 


Asn 


Ser 


Pro 


Thr 
475 


Ser 


Lys 


A3?g 


Leu 


Thr 
480 
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Tyr lie Aen Asn Val Ser Trp Tyr Thr Pro Asn Asn Thr Val Pro Val 

485 490 495 

Ser Met Cys Ser Lys Ser Cys Gin Pro Gly Gin Met Lys Lys Ser Val 

500 505 510 

Gly Leu His Pro Cys Cys Phe Glu Cys Leu Asp Cys Met Pro Gly Thr 

515 520 525 

Tyr Leu Asn Arg Ser Ala Asp Glu Phe Asn Cys Leu Ser Cys Pro Gly 

530 535 540 

Ser Met Trp Ser Tyr Lys Asn Asp lie Thr Cys Phe Gin Arg Arg Pro 
545 550 555 560 

Thr Phe Leu Glu Trp His Glu Ala Pro Thr lie Ala Val Ala Leu Leu 

565 570 575 

Ala Ala Leu Gly Phe Leu Ser Thr Leu Ala lie Leu Val lie Phe Trp 

580 585 590 

Arg His Phe Gin Thr Pro lie Val Arg Ser Ala Gly Gly Pro Met Cys 

595 600 605 

Phe Leu Met Leu Thr Leu Leu Leu Val Ala Tyr Met Val Val Pro Val 

610 615 620 

Tyr Val Gly Pro Pro Lys Val Ser Thr Cys Leu Cys Arg Gin Ala Leu 
625 630 635 640 

Phe Pro Leu Cys Phe Thr lie Cys He Ser Cys He Ala Val Arg Ser 

645 650 655 

Phe Gin He Val Cys Ala Phe Lys Met Ala Ser Arg Phe Pro Arg Ala 

660 665 670 

Tyr Ser Tyr Trp Val Arg Tyr Gin Gly Pro Tyr Val Ser Met Ala Phe 

675 680 685 

He Thr Val Leu Lys Met Val He Val Val He Gly Met Leu Ala Thr 

690 695 700 

Gly Leu Ser Pro Thr Thr Arg Thr Asp Pro Asp Asp Pro Lys He Thr 
705 710 715 720 

He Val Ser Cys Asn Pro Asn Tyr Arg Asn Ser Leu Leu Phe Asn Thr 

725 730 735 

Ser Leu Asp Leu Leu Leu Ser Val Val Gly Phe Ser Phe Ala Tyr Met 

740 745 750 

Gly Lys Glu Leu Pro Thr Asn Tyr Asn Glu Ala Lys Phe He Thr Leu 

755 760 765 

Ser Met Thr Phe Tyr Phe Thr Ser Ser Val Ser Leu Cys Thr Phe Met 

770 775 780 

Ser Ala Tyr Ser Gly Val Leii Val Thr He Val Asp Leu Leu Val Thr 
785 790 795 800 

Val Leu Asn Leu Leu Ala He Ser Leu Gly Tyr Phe Gly Pro Lys Cys 

805 810 815 

Tyr Met He Leu Phe Tyr Pro Glu Arg Asn Thr Pro Ala Tyr Phe Asn 

820 825 830 

Ser Met He Gin Gly Tyr Thr Met Arg Arg Asp 



<211> 853 
<212> PRT 

<213> Artificial Seq[uence 
<220> 

<223> Description of Artificial Sequence; note » 
synthetic construct 

<400> 24 



635 



840 



<210> 



24 



33 
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Met 


Leu 


Gly 


Pro 


1 












Thy 












Gly 


Asp 


Tyr 


Val 






35 




Ala 


Glv 


Leu 


Aira 

rtxy 










irxic 


o C i. 


Cat* 
OCX 


Asn 


65 








Glu 




Tie 


Asn 






XICU 










100 


Ser 


Leu 


Met 


Phe 






XX9 




Cys 


Asn 


Tyr 


xnr 




1 "an 






IlXS 


SenT 


Ser 


V7XU 










Leu 


JXlcu 


Pro 






vjIU 


xnr 


irne 








180 


Gin 


Leu 


Thr 


Ala 










vaj. 


Axa 


AJ.a 


Leu 




210 






ixe 


Phe 


Ser 


Axa 










\9±Y 


Leu 


Val 


Pro 


m n 

oJ.Il 


Asp 


Val 


Ti*5>n 
UCU. 








260 


Leu 


Phe Ala 


Ser 






275 




sesT 


Ser Arg 


Leu 




290 






Txiir 


Ser Asp 


Leu 










vai 


Leu Gly 


Fne 


Tyr 


Val 


Lys 


XIlx 








340 


Ala 


Leu Gly 


Glu 






355 




Arg 


Cys 


Pro 


Gin 




370 






Leu 


Asn His 


UxS 


385 








Ala 


Gin Ala 


Leu 


Ala 


Gin Asp 


Pro 








420 


Leu 


Thr 


Phe 


His 



Ala 


Val 


Leu 


Gly 


5 








Ala 


Pro 


Leu 


Cvs 


Leu 


Glv 


Glv 


Leu 








40 


Ser 


Arg 


Thr 


Arg 






55 




m V 

VJrx 


XICU 


Leu 






70 






Asn 


Lys 


Ser 


Asp 


85 










Thr 


Cys 


Ser 


XICLL 


Ala 


Lys 


Ala 








120 


Gin 


Tyr 


Gin 


Pro 






X J9 




Leu 


2Vl a 
Axel 


Pits u 


V ClX 




150 






Va.± 


Ser 


Tyr 


vjxy 


xQ«> 








Pro 


Ser 






AJ.a 


&1 a 

Aj.a 












200 


Gly 


Ser 


Asp 


Asp 




'71 K 




Leu 


ai a 
Axa 


&1 a 


al a 










Leu 


IrxU 


Arg 


Axa 


245 








His 
XIX o 


Gin 


Val 


Asn 


Val 


His 


Ala 


Ala 








280 


Ser 


Pro 


Lys 


Val 






295 




VelJ. 


Mot* 








310 






Leu 


oxh 


Arg 


wJty 










XIX S 




nxa 


XICU 


Arg 






wxy 








360 


Cys 


Asp 


Cys 


lie 






375 




Gin 


Thr 


Phe 


Ser 




390 






His 


Asn 


Thr 


Leu 


405 








Val 


Lys 


Pro 


Trp 


Val 


Gly 


Gly 


Leu 



Leu 


Ser 


Leu 


Trp 




10 






Leu 


Ser 


Gin 


Gin 










Phe 


Pro 


Xic: u 


Gl V 

\jx jr 


Pro 


Ser 


OCX 


Pro 










Ala 


Leu 


Ala 


Met 






75 




Leu 


Leu 


Pro 


Glv 
ox/ 




90 








IrX w 


Val 
vax 


Val 
vax 


105 








rslv 

vvxy 


Roy 

OCX 


IVt'ct 
Airg 




Arg 


Val 
Vax 


Leu 


211 a 
Axa 








140 


Thr 


Gly 


Lys 


Phe 






155 




a 1 a 
AX a 


OCX 




VjXVL 




170 






Arg 


XXix 


Val 


JrxO 


X09 








Leu 


u±n 


UxU 


irne 


UXU 


Tyr 


uxy 


Arg 








220 


Arg 


Gly 


He 


Cys 










Asp 


Asp 


Ser 


Arg 




250 






VJlJ.Il 


Ser 


Ser 


Val 
Vax 


265 








TT-i n 

IIXD 


a1 a 

AXCL 


UC LL 


JtrXXC 


xrp 


Val 


Al a 
AXd 


OCX 








300 


Pro 


Gly 


Met 


Ala 










Axa 


oXIi 


Leu 


AXB 




^ "5 n 

O J V 






AX a 


Thr 


Asp 


Pro 


•5 A K 








Leu 


wXU 


ril 11 

oXU 


Asp 


Thr 


Leu 


Gin 


Asn 








380 


Val 


Tvr 


Ala 


Ala 






395 




Qln 


Cys 


Asn 


Ala 




410 






Gin 


Leu 


Leu 


Glu 


425 








Pro 


Leu 


Arg 


Phe 



34 



Ala 


Leu 


Leu 


His 






15 




Leu 


Atq 


Met 


Lys 




30 






Glu 


Ala 


Glu 


Glu 


45 








Val 


Cys 


Thr 


Arg 


Lys 


Met 


Ala 


Val 








80 


Leu 


Arg 


Leu 


Gly 






95 




Ala 


Met 


Lvs 


Pro 




110 






He 


Ala 


Ala 


Tvr 


125 








Val 


He 


Glv 


Pro 


Phe 


Ser 


Phe 


Phe 








160 


Leu 


Leu 


Ser 


Ala 






175 




Ser 


Asp 


Axy 


Val 




190 






Qlv 
oxy 


Tim 
xx^ 


Asn 


Tm 

XXJ^ 


205 








Qln 


Glv 


Leu 


OCX 


He 


Ala 


His 


Glu 








240 


Leu 


Gly 


Lys 


Val 






255 




Gin 


Val 


Val 


Leu 




270 






Asn 


Tvr 


Ser 


He 


285 








Glu 


Ala 


Tro 


Leu 


Gin 


Met 


Glv 


Thr 








320 


Glu 


Phe 


Pro 


Gin 






335 




Ala 


Phe 


Cys 


Ser 




350 






Val 


Val 


Qlv 


Gin 


365 








Val 


Ser 


Ala 


Gly 


Val 


Tyr 


Ser 


Val 








400 


Ser 


Gly 


Cys 


Pro 






415 




Asn 


Met 


Tyr 


Asn 




430 






Asp 


Ser 


Ser 


Gly 
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435 


440 


Asn Val 


Asp 


Met Glu Tvr Asn Leu 


450 




455 


Val Pro 


Arcr 


Leu His Asp Val Gly 


465 




470 


Olu Ar^ 


Leu 


IjVs Tie Arcr Tm His 






485 




Cys 


Cat* Aifcr QTn Cvs Glii 






500 


T XT' "D Vi o 
vjxy JtrliC 




Q»T~ rS/o rK/ci TH/f 2icm 
OCX v_jfa o -^y^ 






520 






ASH JrXO XXC 


W u 




535 




Ser 


±r£^0 l9XU Axg OCX XUX 






9 9 w 




wXU 


XX^ v7Xy wXU JrXV AXCL 






565 




VaX 


TtOll fi^ T«A11 'PVll'' TkAll 

xicsu V3xy jjcu xiix xitsu 






sou 




Asp 


Gat* T.A11 Va 1 rtl n 
OCX JrXU XICU 'VdX V7XX1 






O VJ V.I . 


XT IXC *jj.y 


Leu 


Tl A rH/o TiAii f3l \r TiAII 
xxc v^y D jucu V3xy jjcu 


610 




615 


Pjto Gly 


Arg 


IrXO Axg OCX AXa OCX 










Fro 


xicu JLXix uxy wyo xicu 






645 


Gill Tlf» 




Va 1 nl 11 fi^t* GTu Ij^u 

V CL J. VSaLU UCU 






660 


C*\ra Coy* 




T.Aii liira Glv Pt"0 Tna 




675 


680 




Val 


Gill Ala Al A lieu C?va 


690 




695 




VcLX 


TTa 1 T'ViT" Acm "Pttv rtl ti 

vax iiix nsp ixp OXXl 


/ UO 




71 0 
/ XU 


xiis vjys 


Arg 


Nec Arg oer irp vax 










vax 


Xieu AX a iruG Xicu wys 






7Aft 


SeiT Gin 


Pro 


^1*1 w TVwv ^H,Fv> TVavi Rwv 

uxy Axg lyx abh Axg 




/ 3 9 


760 


- 

XiGU AJ.a 


lyx 


Phe Tie Tl f» Tr-n Val 
Jrjuc xxc xxc voix 


770 




775 


VclX ^xu 


VcLX 


Ala TH/T^ GT ti DT*rt &1 a 
AXCt \3^U XrXW Axn 






790 


TV n 4 

cys ivia 


Leu 


vjxy X X c lie u AX a. i uzr 






805 


Leu Leu 


Trp 


Leu Pro Glu Leu Asn 






820 


Ser Pro 


Lys 


Glu Ala Ser Asp Gly 




835 


840 


Arg Gly 


His 


Ser Glu 


850 







<210> 25 
<211> 857 
<212> PRT 









445 








Iivs Leu 




Val 


Trp 


Gin 


Glv 


Ser 






460 










Arg Ph.e 


Asn 


Gly Ser 


Leu 


Aro 

Axy 


Thr 




475 










480 


Tlir Ser 


ASD 


Asn 


Gin 


Lvs 


Pro 


Val 


490 










495 




Glu Gly 


Gin 


Val Arg 


Arcr 


Val 


Lys ' 


505 








510 






fVs VaT 




Cys 


Glu 


Ala 


Gly 


Ser 








525 








AX CI. v»y Ef 


Thy 

XXLX 


Phe Cys 


Gly 


Gin 








540 










TV Otro 
Axy *-yB 


irXlC 


Arg Arg 


Ai^g 


OCX 


Arg 




55Ck 










5f!n 


Val T.A11 
vax JJCU 


Ser 


Leu 


Leu 


XICU 


TiAll 
XiCU 


Leu 


R70 










575 




ai a a1 a 

AXa AXCL 


TaAll 

XICU 


Gly Leu 


f^XXC 


Val 

VCLX 


nxo 










590 






Ala j^e-r 
AX a OCX 


Glv 


Gly Ser 


Leu 


Phe 


Vi»y D 








605 








iriic v^y o 


XJC u 


Ser 


Val 


T.AII 
XICU 


- 

jjeu 


PVlA 
XT IXC 






620 










Vi*y (3 xic u 


AXCL 


Gin Gin 


DT»rv 
JrX \J 


flC L* 


Al a 

AXCL 














640 


Q AT^ 

OCX XJULx 


XICU 


Phe 


Leu 


Gl n 


Al a 

AX^ 


lil a 
AXcl 


650 










655 




OT"rt TtAll 

tr \J XICU 


Cat" 

OCX 


Trp Ala 


AO 11 


ixp 


TiAll 
XICU 


665 








670 






Ala' Trp 


Leu 


Val 


Val 


Leu 


Leu 


Ala 








685 








Ala Trp 


Tvr 


Leu 


Met 


Ala 


Phe 


Pro 






700 










Va 1 Tif»ii 

VCLX XICU 


It XV./ . 


Thr 


Glu 


Val 

VCLX 


XICU 


Gl 11 

V7X U 




/ XO 












Ser Leu 


wxy 


Leu 


Val 


AXS 


Tl A 

xxe 


inr 


7m 










1 SO 




JrllC XICU 


GItt 

V7xy 


Thr 


Phe 


Leu 


Val 

vax 


uxn 


745 








750 
/ o w 






TV 1 a TV V'^i 

AX a Axg 


ni 
vjxy 


Leu 


Thr 


DViA 

irne 


&1 a 

Axa 


MaI> 

pieu 








765 








filAV* DllA 

OCX iruxs . 


Val 

VCLX 


Pro 


Leu 


JJC u 


Ala 


AOU 






780 










Val Gin 


Met 


Gly Ala 


He 


Leu 


Phe 




795 










800 


Phe His 


Leu 


Pro Lys 


Cys 


Tyr 


Val 


810 










815 




Thr Gin 


Glu 


Phe 


Phe 


Leu 


Gly 


Arg 


825 








830 






Asn Ser 


Gly 


Ser 


Ser 


Glu 


Ala 


Thr 



845 



35 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secpience; note = 
synthetic construct 



<400> 25 



Met 


Pro 


Gly 


Leu 


Ala 


lie 


Leu 


Gly 


Leu 


Ser 


Leu Ala Ala Phe Leu Glu 


1 








5 










10 




15 


Leu 


Gly 


Met 


Gly 


Ser 


Ser 


Leu 


Cys 


Leu 


Ser 


Gin Gin Phe Lys Ala Gin 








20 










25 






30 


Glv 


Asp 


Tvr 


lie 


Leu 


Glv 


Glv 


Leu 


Phe 


Pro 


Leu Gly Thr Thr Glu Glu 






35 










40 








45 


Ala 


Thr 


Leu 


Asn 


Gin 


Arg 


Thr 


Gin 


Pro 


Asn 


Gly He Leu Cys Thr Arg 




50 










55 










60 


Phe 


Ser 


Pro 


Leu 


Gly 


Leu 


Phe 


Leu 


Ala 


Met 


Ala Met Lys Met Ala Val 


65 










70 










75 


80 


Glu 


Glu 


lie 


Asn 


Asn 


Glv 


Ser 


Ala 


Leu 


Leu 


Pro Gly Leu Arg Leu Gly 










85 










90 




95 


Tyr 


Asp 


Leu 


Phe 


Asp 


Thr 


Cvs 


Ser 


Glu 


Pro 


Val 


Val Thr Met Lys Pro 








100 










105 






110 


Ser 


Leu 


Met 


Phe 


Met 


Ala 


Lvs 


Val 


Gly 


Ser 


Gin 


Ser He Ala Ala Tyr 






115 










120 








125 


Cvs 


Asn 


Tvr 


Thr 


Gin 


Tvr 


Gin 


Pro 


Arq 


Val 


Leu 


Ala Val He Gly Pro 




130 










135 










140 


iais 


Ser 


Ser 


Glu 


Leu 


Ala 


Leu 


He 


Thr 


Gly 




Phe Phe Ser Phe Phe 


145 










150 










155 


160 


lieu 


Met 


Pro 


Gin 


Val 


Ser 


Tvr 


Ser 


Ala 


Ser 


Met. Asp Arg Leu Ser Asp 










165 










170 




175 


Aro 


Glu 


Thr 


Phe 


Pro 


Ser 


Phe 


Phe 


Ara 


Thr 


Val 


Pro Ser Asp Arg Val 








180 










185 






.190 


Gin 


Leu 


Gin 


Ala 


Val 


Val 


Thr 


Leu 


Leu 


Gin 


Asn 


Phe Ser Trp Asn Trp 






195 










200 








205 


Val 


Ala 


Ala 


Leu 


Glv 


Ser 


Asp 


Asp 


Asp 


Tvr 

X jr 4. 


Gly 


Arg Glu Gly Leu Ser 




210 










215 










220 


lie 


Phe 


Ser 


Glv 


Leu 


Ala 


Asn 


Ser 


Atq 


Glv 


He 


Cys He Ala His Glu 


225 










230 










235 


240 


Qly 


Leu 


Val 


Pro 


Gin 


His 


Asp 


Thr 


Ser 


Gly 


Gin 


Gin Leu Gly Lys Val 










.245 










250 




255 


Val 


Asp 


Val 


Leu 


Arg 


Gin 


Val 


Asn 


Gin 


Ser 


Lys 


Val Gin Val Val Val 








260 










265 






270 


Leu 


Phe 


Ala 


Ser 


Ala 


Arcr 


Ala 


Val 


Tyr 


Ser 


Leu 


Phe Ser Tyr Ser He 






275 










280 








285 


Leu 


His 


Asp 


Leu 


Ser 


Pro 


Lys 


Val 


Trp 


Val 


Ala 


Ser Glu Ser Trp Leu 




290 










295 










300 


Thr 


Ser 


Asp 


Leu 


Val 


Met 


Thr 


Leu 


Pro 


Asn 


He 


Ala Arg Val Gly Thr 


305 










310 










315 


320 


Val 


Leu 


Gly 


Phe 


Leu 


Gin 


Arg 


Gly 


Ala 


Leu 


Leu 


Pro Glu Phe Ser His 










325 










330 




335 


Tyr 


Val 


Glu 


Thr 


Arg 


Leu 


Ala 


Leu 


Ala 


Ala 


Asp 


Pro Thr Phe Cys Ala 








340 










345 






350 


Ser 


Leu 


Lys 


Ala 


Glu 


Leu 


Asp 


Leu 


Glu 


Glu 


Arg 


Val Met Gly Pro Arg 






355 










360 








365 


Cys 


Ser 


Gin 


Cys 


Asp 


Tyr 


lie 


Met 


Leu 


Gin 


Asn 


Leu Ser Ser Gly Leu 




370 










375 










380 


Met 


Gin 


Asn 


Leu 


Ser 


Ala 


Gly 


Gin 


Leu 


His 


His 


Gin He Phe Ala Thr 


385 










390 










395 


400 
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Tyr Ala 


Ala 


Val 


Tyr 


Ser 


Val 


Ala 








405 








Cys Asn 


Val 


Ser 


His 


Cys 


His 


Thr 






420 










Leu Leu 


Glu 


Asn 


Met 


Tyr 


Asn 


Met 




435 










440 


Leu Qln 


Phe 


Asp 


Ala 


Lys 


Gly 


Ser 


450 










455 




Met Trp 


Val 


Trp 


Gin 


Ser 


Pro 


Thr 


465 








470 






Phe Asn 


Gly 


Thr 


Leu 


Gin 


Leu 


Gin 








485 








Asn Gin 


Val 


Pro 


Val 


Ser 


Gin 


Cys 






500 










Val Arg 


Arg 


Val 


Lys 


Gly 


Phe 


His 




515 










520 


Cys Lys 


Ala 


Gly 


Ser 


Tyr 


Arg 


Lys 


530 










535 




Pro Cys 


Gly 


Lys 


Asp 


Gin. 


Trp 


Ser 


545 








550 






Pro Arg 


Arg 


Pro 


Lys 


Phe 


Leu 


Glu 








565 








Leu Leu 


Leu 


Leu 


Leu 


Ser 


Leu 


Ala 






580 










Gly Leu 


Phe 


Val 


His 


His 


Arg 


Asp 




595 










600 


Gly Pro 


Leu 


Ala 


Cys 


Phe 


Gly 


Leu 


610 










615 




Ser Val 


Leu 


Leu 


Phe 


Pro 


Gly 


Gin 


625 








630 






Gin Gin 


Pro 


Leu 


Ser 


His 


Leu 


Pro 








645 








Phe Leu 


Gin 


Ala 


Ala 


Glu 


lie 


Phe 






660 










Trp Ala 


Asp 


Arcr 


Leu 


Ser 


Gly 


Cys 




SIS 










680 


Val Val 


Leu 


Leu 


Ala 


Met 


Leu 


Val 


690 










695 




Leu Val 


Ala 


Phe 


Pro 


Pro 


Glu 


Val 


705 








710 






Thr Glu 


Ala 


Leu 


Val 


His 


Cys 


Arg 








725 








Leu Ala 


His 


Ala 


Thr 


Asn 


Ala 


Thr 






740 










Thr Phe 


Leu 


Val 


Arg 


Ser 


Gin 


Pro 




755 










760 


Leu Thr 


Phe 


Ala 


Met 


Leu 


Ala 


Tyr 


770 










775 




Pro Leu 


Leu 


Ala 


Asn 


Val 


Gin 


Val 


785 








790 






Gly Ala 


Leu 


Leu 


Leu 


Cys 


Val 


Leu 








805 








Pro Arg 


Cys 


Tyr 


Leu 


Leu 


Met 


Arg 






820 










Phe Phe 


Leu 


Gly 


Gly 


Gly 


Pro 


Gly 




835 










840 


Asn Thr 


Gly 


Asn 


Gin 


Gly 


Lys 


His 



Qln 


Ala 


Leu 


His Asn 


Thr 


Leu 


Gin 




410 








415 




Ser 


Glu 


Pro 


Val Gin 


Pro 


Trp 


Gin 


425 








430 






Ser 


Phe 


Arg 


Ala Arg 


Asp 


Leu 


Thr 








445 








Val 


Asp 


Met 


Glu Tyr 


Asp 


Leu 


Lys 








460 








Pro 


Val 


Leu 


His Thr 


Val 


Gly 


Thr 






475 








480 


His 


Ser 


Lys 


Met Tyr 


Trp 


Pro 


Gly 




490 








495 




Ser 


Arg 


Gin 


Cys Lys 


Asp 


Gly 


Gin 


505 








510 






Ser 


Cys 


Cys 


Tyr Asp 


Cys 


Val 


Asp 








525 








His 


Pro 


Asp 


Asp Phe 


Thr 


Cys 


Thr 








540 








Pro 


Glu 


Lys 


Ser Thr 


Thr 


Cys 


Leu 






555 








560 


Trp 


Gly 


Glu 


Pro Ala 


Val 


Leu 


Leu 




570 








575 




Leu 


Gly 


Leu 


Val Leu 


Ala 


Ala 


Leu 


585 








590 






Ser 


Pro 


Leu 


Val Gin 


Ala 


Ser 


Gly 








605 








Val 


Cys 


Leu 


Gly Leu 


Val 


Cys 


Leu 








620 








Pro 


Ser 


Pro 


Ala Arg 


Cys 


Leu 


Ala 






635 








640 


Leu 


Thr 


Gly 


Cys Leu 


Ser 


Thr 


Leu 




650 








655 




Val 


Glu 


Ser 


Glu Leu 


Pro 


Leu 


Ser 


665 








670 






Leu 


Arg 


Gly 


Pro Trp 


Ala 


Trp 


Leu 








685 








Glu 


Val 


Ala 


Leu Cys 


Thr 


Trp 


Tyr 








700 








Val 


Thr 


Asp 


Trp His 


Met 


Leu 


Pro 






715 








720 


Thr 


Arg 


Ser 


Trp Val 


Ser 


Phe 


Gly 




730 








735 




Leu 


Ala 


Phe 


Leu Cys 


Phe 


Leu 


Gly 


745 








750 






Gly 


Arg 


Tyr 


Asn Arg 


Ala 


Arg 


Gly 








765 








Phe 


He 


Thr 


Trp Val 


Ser 


Phe 


Val 








780 








Val 


Leu 


Arg 


Pro Ala 


Val 


Gin 


Met 






795 








800 


Gly 


He 


Leu 


Ala Ala 


Phe 


His 


Leu 




810 








815 




Gin 


Pro 


Gly 


Leu Asn 


Thr 


Pro 


Glu 


825 








830 






Asp 


Ala 


Gin 


Gly Gin 


Asn 


Asp 


Gly 



845 



Glu 
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850 855 

<210> 26 
<211> 840 
<212> PRT 

<213> Artificial Sec[uence 
<220> 

<223> Description of Artificial Sequence; note « 
synthetic constzruct 



<400> 26 






















Met 


Gly 


Pro 


Arg 


Ala 


Lys 


Thr 


He Cys 


Ser 


Leu 


Phe 


Phe Leu Leu Trp 


1 








5 










10 






15 


val 


Leu 


Ala 


Glu 


Pro 


Ala 


Glu 


Asn 


Ser 


Asp 


Phe 


Tyr 


Leu Pro Gly Asp 








20 










25 








30 


Tyr 


Leu 


Leu 


Gly 


Gly 


Leu 


Phe 


Ser 


Leu 


His 


Ala 


Asn 


Met Lys Gly He 






35 










40 










45 


Val 


His 


Leu 


Asn 


Phe 


Leu 


Gin 


Val 


Pro 


Met 


Cys 


Lys 


Glu Tyr Glu Val 




50 










55 










60 




Lys 


Val 


He 


Gly 


Tyr 


Asn 


Leu 


Met 


Gin 


Ala 


Met 


Arg 


Phe Ala Val Glu 


65 










70 










75 




80 


Glu 


He 


Asn 


Asn 


Asp 


Ser 


Ser 


Leu 


Leu 


Pro 


Gly 


Val 


Leu Leu C31y Tyr 










85 










90 






95 


Glu 


He 


Val 


Asp 


Val 


Cys 


Tyr 


He 


Ser 


Asn 


Asn 


Val 


Gin Pro Val Leu 








100 










105 








110 


Tyr 


Phe 


Leu 


Ala 


His 


Glu 


Asp 


Asn 


Leu 


Leu 


Pro 


He 


Gin Glu Asp Tyr 






115 










120 










125 


Ser 


Asn 


Tyr 


He 


Ser 


Arg 


Val 


Val 


Ala 


Val 


He 


Gly 


Pro Asp Asn Ser 




130 










135 










140 




Glu 


Ser 


Val 


Met 


Thr 


Val 


Ala 


Asn 


Phe 


Leu 


Ser 


Leu 


Phe Leu Leu Pro 


145 










150 










155 




160 


Gin 


He 


Thr 


Tyr 


Ser 


Ala 


He 


Ser Asp 


Glu 


Leu 


Arg 


Asp Lys Val Arg 










165 










170 






175 


Phe 


Pro 


Ala 


Leu 


Leu 


Arg 


Thr 


Thr 


Pro 


Ser 


Ala 


Asp 


His His Val Glu 








180 










185 








190 


Ala 


Met 


Val 


Gin 


Leu 


Met 


Leu 


His 


Phe 


Arg 


Trp 


Asn 


Trp He He Val 






195 










200 










205 


Leu 


Val 


Ser 


Ser 


Asp 


thr 


Tyr 


Gly Arg 


Asp 


Asn 


Gly 


Gin Leu Leu Gly 




210 










215 










220 




Glu 


Arg 


Val 


Ala 


Arg 


Arg 


Asp 


He Cys 


He 


Ala 


Phe 


Gin Glu Thr Leu 


225 










230 










235 




240 


Pro 


Thr 


Leu 


Gin 


Pro 


Asn 


Gin 


Asn 


Met 


Thr 


Ser 


Glu 


Glu Arg Gin Arg 










245 










250 






255 


Leu 


Val 


Thr 


He 


Val 


Asp 


Lys 


Leu 


Gin 


Gin 


Ser 


Thr 


Ala Arg Val Val 








260 










265 








270 


Val 


Val 


Phe 


Ser 


Pro 


Asp 


Leu 


Thr 


Leu 


Tyr 


His 


Phe 


Phe Asn Glu Val 






275 










280 










285 


Leu 


Arg 


Gin 


Asn 


Phe 


Thr 


Gly 


Ala 


Val 


Trp 


He 


Ala 


Ser Glu Ser Trp 




290 










295 










300 




Ala 


He 


Asp 


Pro 


Val 


Leu 


His 


Asn 


Leu 


Thr 


Glu 


Leu 


Gly His Leu Gly 


305 










310 










315 




320 


Thr 


Phe 


Leu 


Gly 


He 


Thr 


He 


Gin 


Ser 


val 


Pro 


He 


Pro Gly Phe Ser 










325 










330 






335 


Glu 


Phe 


Arg 


Glu 


Trp 


Gly 


Pro 


Gin Ala 


Gly 


Pro 


Pro 


Pro Leu Ser Arg 








340 










345 








350 


Thr 


Ser 


Gin 


Ser 


Tyr 


Thr 


Cys 


Asn 


Gin 


Glu 


Cys 


Asp 


Asn Cys Leu Asn 
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355 360 365 

Ala Thr Leu Ser Phe Asn Thr lie Leu Arg Leu Ser Gly Glu Arg Val 

370 375 380 

Val Tyr Ser Val Tyr Ser Ala Val Tyr Ala Val Ala His Ala Leu His 
385 390 395 400 

Ser Leu Leu Gly Cys Asp Lys Ser Thr Cys Thr Lys Arg Val Val Tyr 

405 410 415 

Pro Trp Gin Leu Leu Glu Glu lie Trp Lys Val Asn Phe Thr Leu Leu 

420 425 430 

Asp His Gin lie Phe Phe Asp Pro Gin Gly Asp Val Ala Leu His Leu 

435 440 445 

Glu lie Val Gin Trp Gin Trp Asp Arg Ser Gin Asn Pro Phe Gin Ser 
450 . 455 460 

Val Ala Ser Tyr Tyr Pro Leu Gin Arg Gin Leu Lys Asn lie Gin Asp 

465 470 475 480 

lie Ser Trp His Thr Val Asn Asn Thr lie Pro Met Ser Met Cys Ser 

485 490 495 

Lys Arg Cys Gin Ser Gly Gin Lys Lys Lys Pro Val Gly lie His Val 

500 505 510 

Cys Cys Phe Glu Cys lie Asp Cys Leu Pro Gly Thr Phe Leu Asn His 

515 520 525 

Thr Glu Asp Glu Tyr Glu Cys Gin Ala Cys Pro Asn Asn Glu Trp Ser 

530 535 540 

Tyr Gin Ser Glu Thr Ser Cys Phe Lys Arg Gin Leu Val Phe Leu Glu 
545 550 555 560 

Leu Arg Glu His Thr Ser Tzp Val Leu Leu Ala Ala Asn Thr Leu Leu 

565 570 575 

Leu Leu Leu Leu Leu Gly Thr Ala Gly Leu Phe Ala Trp His Leu Asp 

580 585 590 

Thr Pro Val Val Arg Ser Ala Gly Gly Arg Leu Cys Phe Leu Met Leu 

595 600 605 

Gly Ser Leu Ala Ala Gly Ser Gly Ser Leu Tyr Gly Phe Phe Gly Glu 

610 615 620 

Pro Thr Arg Pro Ala Cys Leu Leu Arg Gin Ala Leu Phe Ala Leu Gly 
625 630 635 640 

Phe Thr lie Phe Leu Ser Cys Leu Thr Val Arg Ser Phe Gin Leu lie 

645 650 655 

He He Phe Lys Phe Ser Thr Lys Val Pro Thr Phe Tyr His Ala Trp 

660 665 670 

Val Gin Asn His Gly Ala Gly Leu Phe Val Met He Ser Ser Ala Ala 

675 680 685 

Gin Leu Leu He Cys Leu Thr Trp Leu Val Val Trp Thr Pro Leu Pro 

690 695 700 

Ala Arg Glu Tyr Gin Arg Phe Pro His Leu Val Met Leu Glu Cys Thr 
705 710 715 720 

Glu Thr Asn Ser Leu Gly Phe He Leu Ala Phe Leu Tyr Asn Gly Leu 

725 730 735 

Leu Ser He Ser Ala Phe Ala Cys Ser Tyr Leu Gly Lys Asp Leu Pro 

740 745 750 

Glu Asn Tyr Asn Glu Ala Lys Cys Val Thr Phe Ser Leu Leu Phe Asn 

755 760 765 

Phe Val Ser Trp He Ala Phe Phe Thr Thr Ala Ser Val Tyr Asp Gly 

770 775 780 

Lys Tyr Leu Pro Ala Ala Asn Met Met Ala Gly Leu Ser Ser Leu Ser 
785 790 795 800 
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Ser Gly Phe Gly Gly Tyr Phe Leu Pro Lys Cys Tyr Val lie Leu Cys 

805 810 . 815 

Arg Pro Asp Leu Asn Ser Thr Glu His Phe Gin Ala Ser . lie Gin Asp 

820 825 830 

Tyr Thr Arg Arg Cys Gly Ser Thr 
835 840 

<210> 27 
<211> 840 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note « 
synthetic construct 



<400> 27 












Met 


Leu 


Leu 


Cys 


Thr Ala 


Arg Leu Val 


Gly Leu Gin Leu Leu He Ser 


1. 








5 






10 15 


Cys 


Cys 


Trp 


Ala 


Phe 


Ala 


Cys His Ser 


Thr Glu Ser Ser Pro Asn Phe 








20 






.25 


30 


Thr 


Leu 


Pro 


Glv 


Asp 


Tyr 


Leu Leu Ala 


Glv Len Phe Pro Leu Hie SeT* 






35 








40 


45 


Gly 


Cys 


Leu 


Gin 


Val Arg 


His Ara Pro 


Glu Val ThT Leu Cvs Aen Ara 




50 










55 


60 


Ser 


Cys 


Ser 


Phe 


Asn 


Glu 


His Gly Tyr 


His Leu Phe Gin Ala Met Arg 


65 










70 




75 80 


Leu 


Gly 


Val 


Glu 


Glu 


He 


Asn Asn Ser* 


Thr Ala' Leu Leu Pro Asn He 










85 






90 . . 95 


Thr 


Leu 


Gly 


Tvr 


Gin 


Leu 


Tvr Asn Val 


• Cvs Ser Asn Ser Ala Asn Val 








100 






105 


110 


Tyr 


Ala 


Thr 


Leu 


Arg Val 


Leu Ser Leu 


Pro Gly Gin His His He Glu 






115 








120 


125 


Leu 


Gin 


Gly 


Asp 


Leu 


Leu 


His Tyr Ser 


Pro Thr Val Leu Ala Val He 




130 










135 


140 


Gly 


Pro 


Asp 


Ser 


Thr 


Asn 


Arg. Ala Ala 


Thr Thr Ala Ala Leu Leu Ser 


145 










150 




155 160 


Pro 


Phe 


Leu 


Val 


Pro 


Met 


He Ser Tyr 


Ala Ala Ser Ser Glu Thr Leu 










165 






170 175 


Ser 


Val 


Lys 


Arg 


Gin 


Tyr 


Pro Ser Phe 


Leu Arg Thr He Pro Asn Asp 








180 






185 


190 


Lys 


Tyr 


Gin 


Val 


Glu 


Thr 


Met Val Leu 


Leu Leu Gin Lys Phe Gly Trp 






195 








200 


205 


Thr 


Trp 


lie 


Ser 


Leu 


Val 


Gly Ser Ser 


Asp Asp Tyr Gly Gin Leu Gly 




210 










215 


220 


Val 


Gin 


Ala 


Leu 


Glu 


Asn 


Gin Ala Thr 


Gly Gin Gly He Cys He Ala 


225 










230 




235 240 


Phe 


Lys 


Asp 


He 


Met 


Pro 


Phe Ser Ala 


Gin Val Gly Asp Glu Arg Met 










245 






250 255 


Gin 


Cys 


Leu 


Met 


Arg 


His 


Leu Ala Gin 


Ala Gly Ala Thr Val Val Val 








260 






265 


270 


Val 


Phe 


Ser 


Ser 


Arg 


Gin 


Leu Ala Arg 


Val Phe Phe Glu Ser Val Val 






275 








280 


285 


Leu 


Thr 


Asn 


Leu 


Thr 


Gly 


Lys Val Trp 


Val Ala Ser Glu Ala Trp Ala 




290 










295 


300 


Leu 


Ser Arg 


His 


He 


Thr 


Gly Val Pro 


Gly He Gin Arg He Gly Met 


305 










310 




315 320 
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Val Leu 


Gly 


Val 


Ala 


He 


Gin 


Lys 


Arg 


Ala 


Val 


Pro Gly Leu Lys Ala 








325 










330 




335 


Phe Glu 


Glu 


Ala 


Tyr 


Ala 


Arg 


Ala 


Asp 


Lys 


Lys 


Ala Pro Arg Pro Cys 






340 










345 






350 


His Lys 


Gly 


Ser 


Trp 


Cys 


Ser 


Ser 


Asn 


Gin 


Leti 


Cys Arg Glu Cys Gin 




355 










360 








365 


Ala Phe 


Met 


Ala 


His 


Thr 


Met 


Pro 


Lys 


Leu 


Lys 


Ala Phe Ser Met Ser 


370 










375 










380 


Ser Ala 


Tyr 


Asn 


Ala 


Tyr 


Arg 


Ala 


Val 


Tyr 


Ala 


Val Ala His Gly Leu 


385 








390 










395 


400 


His Qln 


Leu 


Leu 


Gly 


Cys 


Ala 


Ser 


Gly 


Ala 


Cys 


Ser Arg Gly Arg Val 








405 










410 




415 


Tyr Pro 


Trp 


Gin 


Leu 


Leu 


Glu 


Gin 


He 


His 


Lys 


Val His Phe Leu Leu 






420 










425 






430 


His Lys 


Asp 


Thr 


Val 


Ala 


Phe 


Asn 


Asp 


Asn 


Arg 


Asp Pro Leu Ser Ser 




435 










440 








445 


Tyr Asn 


He 


He 


Ala 


Trp 


Asp 


Trp 


Asn 


Gly 


Pro 


Lys Trp Thr Phe Thr 


450 










455 










460 


Val Leu 


Gly 


Ser 


Ser 


Thr 


Trp 


Ser 


Pro 


Val 


Gin 


Leu Asn He Asn Glu 


465 








470 










475 


480 


Thr Lys 


He 


Gin 


Trp 


His 


Gly 


Lys 


Asp 


Asn 


Gin 


Val Pro Lys Ser Val 








485 










490 




495 


Cys Ser 


Ser 


Asp 


Cys 


Leu 


Glu 


Gly 


His 


Qln 


Arg 


Val Val Thr Gly Phe 






500 










505 






510 


His His 


Cys 


Cys 


Phe 


Glu 


Cys 


Val 


Pro 


Cys 


Gly 


Ala Gly Thr Phe Leu 




515 








520 








525 


Asn Lys 


Ser 


Asp 


Leu 


Tyr 


Arg 


Cys 


Gin 


Pro 


Cys 


Gly Lys Glu Glu Trp 


530 










535 










540 


Ala Pro 


Glu 


Gly 


Ser 


Gin 


Thr 


Cys 


Phe 


Pro 


Arg 


Thr Val Val Phe Leu 


545 








550 










555 


560 


Glu Trp 


His 


Glu 


Ala 


Pro 


Thr 


He 


Ala 


Val 


Ala 


Leu Leu Ala Ala Leu 








565 










570 




575 


Gly Phe 


Leu 


Ser 


Thr 


Leu 


Ala 


He 


Leu 


Val 


He 


Phe Trp Arg His Phe 






580 










585 






590 


Gin Thr 


Pro 


He 


Val 


Arg 


Ser 


Ala 


Gly 


Gly 


Pro 


Met Cys Phe Leu Met 




595 










600 








605 


Leu Thr 


Leu 


Leu 


Leu 


Val 


Ala 


Tyr 


Met 


Val 


Val 


Pro Val Tyr Val Gly 


610 










615 










620 


Pro Pro 


Lys 


val 


Ser 


Thr 


Cys 


Leu 


Cys 


Arg 


Gin 


Ala Leu Phe Pro Leu 


625 








630 










635 


640 


Cys Phe 


Thr 


He 


Cys 


He 


Ser 


Cys 


He 


Ala 


Val 


Arg Ser Phe Gin He 








645 










650 




655 


Val Cys 


Ala 


Phe 


Lys 


Met 


Ala 


Ser 


Arg 


Phe 


Pro 


Arg Ala Tyr Ser Tyr 






660 










665 






670 


Trp Val 


Arg 


Tyr 


Gin 


Gly 


Pro 


Tyr 


Val 


Ser 


Met 


Ala Phe He Thr Val 




675 










680 








685 


Leu Lys 


Met 


Val 


He 


Val 


Val 


He 


Gly 


Met 


Leu 


Ala Thr Gly Leu Ser 


690 










695 










700 


Pro Thr 


Thr 


Arg 


Thr 


Asp 


Pro 


Asp 


Asp 


Pro 


Lys 


He . Thr He Val Ser 


705 








710 










715 


720 


Cys Asn 


Pro 


Asn 


Tyr 


Arg 


Asn 


Ser 


Leu 


Leu 


Phe 


Asn Thr Ser Leu Asp 








725 










730 




735 


Leu Leu 


Leu 


Ser 


Val 


Val 


Gly 


Phe 


Ser 


Phe 


Ala 


Tyr Met Gly Lys Glu 






740 










745 






750 


Leu Pro 


Thr 


Asn 


Tyr 


Asn 


Glu 


Ala 


Lys 


Phe 


He 


Thr Leu Ser Met Thr 




755 










760 








765 


Phe Tyr 


Phe 


Thr 


Ser 


Ser 


Val 


Ber 


Leu 


Cys 


Thr 


Phe Met Ser Ala Tyr 
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770 775 780 

Ser Gly Val Leu Val Thr lie Val Asp Leu Leu Val Thr Val Leu Asn 
785 790 795 800 

Leu Leu Ala lie Ser Leu Gly Tyr Phe Gly Pro Lys Cys Tyr Met He 

805 810 815 

Leu Phe Tyr Pro Glu Arg Asn Thr Pro Ala Tyr Phe Asn Ser Met He 

820 825 830 

Gin Gly Tyr Thr Met Arg Arg Asp 
835 .840 

<210> 28 
<211> 1123 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic constzruct 

<400> 28 

Leu Gin Val Arg His Arg Pro Glu Val Thr Leu Cys Asp Arg Ser Cys 

1 5 10 15 

Ser Phe Asn Glu His Gly Tyr His Leu Phe Gin Ala Met Arg Leu Gly 

20 25 30 

Val Glu Glu He Asn Asn Ser Thr Ala Leu Leu Pro Asn He Thr Leu 

35 40 45 

Gly Tyr Gin Leu Tyr Asp Val Cys Ser Asp Ser Ala Asn Val Tyr Ala 

50 55 60 

Thr Leu Arg Val Leu Ser Leu Pro Gly Gin His His He Glu Leu Gin 
65 70 75 80 

Gly Asp Leu Leu His Tyr Ser Pro Thr Val Leu Ala Val He Gly Pro 

85 90 95 

Asp Ser Thr Asn Arg Ala Ala Thr Thr Ala Ala Leu Leu Ser Pro Phe 

. 100 105 110 

Leu Val Pro Met He Ser Tyr Ala . Ala Ser Ser Glu Thr Leu Ser Val 

115 . 120 125 

Lys Arg Gin Tyr Pro Ser Phe Leu Arg Thr He Pro Asn Asp Lys Tyr 

130 135 . 140 

Gin Val Glu Thr Met Val Leu Leu Leu Gin Lys Phe Gly Trp Thr Trp 
145 150 155 160 

He Ser Leu Val Gly Ser Ser Asp Asp Tyr Gly Gin Leu Gly Val Gin 

165 170 175 

Ala Leu Glu Asn Gin Ala Thr Gly Gin Gly He Cys He Ala Phe Lys 

180 185 190 

Asp He Met Pro Phe Ser Ala Gin Val Gly Asp Glu Arg Met Gin Cys 

195 200 205 

Leu Met Arg His Leu Ala Gin Ala Gly Ala Thr Val Val Val Val Phe 

210 215 220 

Ser Ser Arg Gin Leu Ala Arg Val Phe Phe Glu Ser Val Val Leu Thr 
225 230 235 240 

Asn Leu Thr Gly Lys Val Trp Val Ala Ser Glu Ala Trp Ala Leu Ser 

. 245 .250 255 

Arg His He Thr Gly Val Pro Gly He Gin Arg He Gly Met Val Leu 

260 265 270 

Gly Val Ala He Gin Lys Arg Ala Val Pro Gly Leu Lys Ala Phe Glu 

275 280 285 

Glu Ala Tyr Ala Arg Ala Asp Lys Lys Ala Pro Arg Pro Cys His Lys 
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290 295 300 

Gly Ser Trp Cys Ser Ser Asn Gin Leu Cys Arg Glu Cfys Gin Ala Phe 
305 310 315 320 

Met Ala His Thr Met Pro Lys Leu Lys Ala Phe Ser Met Ser Ser Ala 

325 330 335 

Tyr Asn Ala Tyr Arg Ala Val Tyr Ala Val Ala His Gly Leu His Gin 

340 345 350 

Leu Leu Gly Cys Ala Ser Gly Ala Cys Ser Arg Gly Arg Val Tyr Pro 

355 360 365 

Trp Gin Leu Leu Glu Gin He His Lys Val His Phe Leu Leu His Lys 

370 375. 380 

Asp Thr Val Ala Phe Asn Asp Asn Arg Asp Pro Leu Ser Ser Tyr Asn 
385 390 395 400 

He lie Ala Trp Asp Trp Asn Gly Pro Lys Trp Thr Phe Thr Val Leu 

405 410 415 

Gly Ser Ser Thr Trp Ser Pro Val Gin Leu Asn He Asn Glu Thr Lys 
420 425 430 

He Gin Trp His Gly Lys. Asp Asn Gin Val Pro Lys Ser Val Cys Ser 

435 440 445 

Ser Asp Cys Leu Glu Gly His Gin Arg Val Val Thr Gly Phe His His 

450 455 460 

Cys Cys Phe Glu Cys Val Pro Cys Gly Ala Gly Thr Phe Leu Asn Lys 
465 470 475 480 

Ser Asp Leu Tyr Arg Cys Gin Pro Cys Gly Lys Glu Glu Trp Ala Pro 

485 490 495 

Glu Gly Ser Gin Thr Cys Phe Pro Arg Thr Val Val Phe Leu Glu Trp 

500 505 510 

Ser Asp He Glu Ser He He Ala He Ala Phe Ser Cys Leu Gly He 

515 520 525 

Leu Val Thr Leu Phe Val Thr Leu He Phe Val Leu Tyr Arg Asp Thr 

530 535 540 

Pro Val Val Lys Ser Ser Ser Arg Glu Leu Cys Tyr He He Leu Ala 
545 550 555 560 

Gly He Phe Leu Gly Tyr Val Cys Pro Phe Thr Leu He Ala Lys Pro . 

565 570 575 

Thr Thr Thr Ser Cys Tyr Leu Gin Arg Leu Leu Val Gly Leu Ser Ser 

580 585 590 

Ala Met Cys Tyr Ser Ala Leu Val , Thr Lys Thr Asn Arg He Ala Arg ' 

595 600 605 

He Leu Ala Gly Ser Lys Lys Lys He Cys Thr Arg Lys Pro Arg Phe 

610 615 620 

Met Ser Ala Trp Ala Gin Val He He Ala Ser He Leu He Ser Val 
625 630 635 640 

Gin Leu Thr Leu Val Val Thr Leu He He Met Glu Pro Pro Met Pro 

645 650 655 

He Leu Ser Tyr Pro Ser He Lys Glu Val Tyr Leu He Cys Asn Thr 

660 665 670 

Ser Asn Leu Gly Val Val Ala Pro Val Gly Tyr Asn Gly Leu Leu He 

675 680 685 

Met Ser Cys Thr Tyr Tyr Ala Phe Lys Thr Arg Asn Val Pro Ala Asn 

690 695 700 

Phe Asn Glu Ala Lys Tyr lie Ala Phe Thr Met Tyr Thr Thr Cys lie 
705 710 715 720 

He Trp Leu Ala Phe Val Pro He Tyr Phe Gly Ser Asn Tyr Lys He 

725 730 735 

He Thr Thr Cys Phe Ala Val Ser Leu Ser Val Thr Val Ala Leu Gly 
740 745 750 
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Cys Met Phe Thr Pro Lys Met Tyr lie lie He Ala Lys Pro Glu Arg 

755 760 765 

Asn Val Arg Ser Ala Phe Thr Thr Ser Asp Val Val Arg Met His Val 

770 775 780 

Gly Asp Gly Lys Leu Pro Cys Arg Ser Asn Thr Phe Leu Asn He Phe 
785 790 795 800 

Arg Arg Lys Lys Pro Gly Ala Gly Asn Ala Asn Ser Asn Gly Lys Ser 

805 810 815 

Val Ser Trp Ser Glu Pro Gly Gly Arg Gin Ala Pro Lys Gly Gin His 

820 825 830 

Val Trp Gin Arg Leu Ser Val His Val Lys Thr Asn Glu Thr Ala Cys 

835 840 845 

Asn Gin Thr Ala Val He Lys Pro Leu Thr Lys Ser Tyr Gin Gly Ser 

850 855 860 

Gly Lys Ser Leu Thr Phe Ser Asp Ala Ser Thr Lys Thr Leu Tyr Asn 
865 870 875 880 

Val Glu Glu Glu Asp Asn Thr Pro Ser Ala His Phe Ser Pro Pro Ser 

885 890 895 

Ser Pro Ser Met Val Val His Arg Arg Gly Pro Pro Val Ala Thr Thr 

900 905 910 

Pro Pro Leu Pro Pro His Leu Thr Ala Glu Glu Thr Pro Leu Phe Leu 

915 920 925 

Ala Asp Ser Val He Pro Lys Gly Leu Pro Pro Pro Leu Pro Gin Gin 

930 935 940 

Gin Pro Gin Gin Pro Pro Pro Gin Gin Pro Pro Gin Gin Pro Lys Ser 
945 950 955 960 

Leu Met Asp Gin Leu Gin Gly Val Val Thr Asn Phe Gly Ser Gly He 

965 970 975 

Pro Asp Phe His Ala Val Leu Ala Gly Pro Gly Thr Pro Gly Asn Ser 

980 985 990 

Leu Arg Ser Leu Tyr Pro Pro Pro Pro Pro Pro Gin His Leu Gin Met 

995 1000 1005 

Leu Pro Leu His Leu Ser Thr Phe Gin Glu Glu Ser He Ser Pro Pro 

1010 1015 1020 

Gly Glu Asp He Asp Asp Asp Ser Glu Arg Phe Lys Leu Leu Gin Glu 
1025 1030 1035 1040 

Phe Val Tyr Glu Arg Glu Gly Asn Thr Glu Glu Asp Glu Leu Glu Glu 

1045 1050 1055 

Glu Glu Asp Leu Pro Thr Ala Ser Lys Leu Thr Pro Glu Asp Ser Pro 

1060 1065 1070 

Ala Leu Thr Pro Pro Ser Pro Phe Arg Asp Ser Val Ala Ser Gly Ser 

1075 1080 1085 

Ser Val Pro Ser Ser Pro Val Ser Glu Ser Val Leu Cys Thr Pro Pro 

1090 1095 1100 

Asn Val Thr Tyr Ala Ser Val He Leu Arg Asp Tyr Lys Gin Ser Ser 
1105 1110 1115 1120 

Ser Thr Leu 



<210> 29 
<211> 1172 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note » 
synthetic construct 
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<400> 29 

Met Gly Pro Arg Ala Lys Thr He Cys Ser Leu Phe Phe Leu Leu Trp 

1 5 10 15 

Val Leu Ala Glu Pro Ala Glu Asn Ser Asp Phe Tyr Leu Pro Gly Asp 

20 25 30 

Tyr Leu Leu Gly Gly Leu Phe Ser Leu His Ala Asn Met Lys Gly He 

35 40 45 

Val His Leu Asri Phe Leu Gin Val Pro Met Cys Lys Glu Tyr Glu Val 

50 55 60 

Lys Val He Gly Tyr Asn Leu. Met Gin Ala Met Arg Phe Ala Val Glu 
65 70 75 80 

Glu He Asn Asn Asp Ser Ser Leu Leu Pro Gly Val Leu Leu Gly Tyr 

85 90 95 

Glu He Val Asp Val Cys Tyr . He Ser Asn Asn Val Gin Pro Val Leu 

100 105 110 

Tyr Phe Leu Ala His Glu Asp Asn Leu Leu Pro He Gin Glu Asp Tyr 

115 120 125 

Ser Asn Tyr He Ser Arg Val Val Ala Val He Gly Pro Asp Asn Ser 

130 135 140 

Glu Ser Val Met Thr Val Ala Asn Phe Leu Ser Leu Phe Leu Leu Pro 
145 150 155 160 

Gin He Thr Tyr Ser Ala He Ser Asp Glu Leu Arg Asp Lys Val Arg 

165 170 175 

Phe Pro Ala Leu Leu Arg Thr Thr Pro Ser Ala Asp His His Val Glu 

180 185 190 

Ala Met Val Gin Leu Met Leu His Phe Arg Trp Asn Trp He He Val 

195 200 205 

Leu Val Ser Ser Asp Thr Tyr Gly Arg Asp Asn Gly Gin Leu Leu Gly ' 

210 215 220 

Glu Arg Val Ala Arg Arg Asp He Cys He Ala Phe Gin Glu Thr Leu 
225 230 235 240 

Pro Thr Leu Gin Pro Asn Gin Asn Met Thr Ser Glu Glu Arg Gin Arg 

245 250 255 

Leu Val Thr He Val Asp Lys Leu Gin Gin Ser Thr Ala Arg Val Val 

260 265 270 

Val Val Phe Ser Pro Asp Leu Thr Leu Tyr His Phe Phe Asn Glu Val 

275 280 285 

Leu Arg Gin Asn Phe Thr Gly Ala Val Trp He Ala Ser Glu Ser Trp 

290 295 300 

Ala He Asp Pro Val Leu His Asn Leu Thr Glu Leu Gly His Leu Gly 
305 310 315 320 

Thr Phe Leu Gly He Thr He Gin Ser Val Pro He Pro Gly Phe Ser 

325 330 335 

Glu Phe Arg Glu Trp Gly Pro Gin Ala Gly Pro Pro Pro Leu Ser Arg 

340 345 350 

Thr Ser Gin Ser Tyr Thr Cys Asn Gin Glu Cys Asp Asn Cys Leu Asn 

355 360 365 

Ala Thr Leu Ser Phe Asn Thr He Leu Arg Leu Ser Gly Glu Arg Val 

370 375 380 

Val Tyr Ser Val Tyr Ser Ala Val Tyr Ala Val Ala His Ala Leu His 
385 390 395 400 

Ser Leu Leu Gly Cys Asp Lys Ser Thr Cys Thr Lys Arg Val Val Tyr 

405 410 415 

Pro Trp Gin Leu Leu Glu Glu He Trp Lys Val Asn Phe Thr Leu Leu 

420 425 430 

Asp His Gin He Phe Phe Asp Pro Gin Gly Asp Val Ala Leu His Leu 
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435 440 445 

Glu He Val Gin Trp Gin Trp Asp Arg Ser Gin Asn Pro Phe Gin Ser 

450 455 460 

Val Ala Ser Tyr Tyr Pro Leu Gin Arg Gin Leu Lys Asn He Gin Asp 
465 470 475 480 

lie Ser Trp His Thr Val Asn Asn Thr He Pro Met Ser Met Cys Ser 

485 490 . 495 

Lys Arg Cys Gin Ser Gly .Gin Lys Lys Lys Pro Val Gly He His Val 

500 505 510 

Cys Cys Phe Glu Cys He Asp Cys Leu Pro Gly Thr Phe Leu Asn His 

515 520 525 

Thr Glu Asp Glu Tyr Glu Cys Gin Ala Cys Pro Asn Asn Glu Trp Ser 

530 535 540 

Tyr Gin Ser Glu Thr Ser Cys Phe Lys Arg Gin Leu Val Phe Leu Glu 
545 550 555 560 

Trp Ser Asp He Glu Ser He He Ala He Ala Phe Ser Cys Leu Gly 

565 570 575 

He Leu Val Thr Leu Phe Val Thr Leu He Phe Val Leu Tyr Arg Asp 

580 585 590 

Thr Pro Val Val Lys Ser Ser Ser Arg Glu Leu Cys Tyr He He Leu 

595 600 605 

Ala Gly He Phe Leu Gly Tyr Val Cys Pro Phe Thr Leu He Ala Lys 

610 615 620 

Pro Thr Thr Thr Ser Cys Tyr Leu Gin ArQ Leu Leu Val Gly Leu Ser 
625 630 635 640 

Ser Ala Met Cys Tyr Ser Ala Leu Val Thr Lys Thr Asn Arg He Ala 

645 650 > 655 

Arg He Leu Ala Gly Ser Lys Lys Lys He Cys Thr Arg Lys Pro Arg 

660 665 670 

Phe Met Ser Ala Trp Ala Gin Val He He Ala Ser He Leu He Ser 

675 680 685 

Val Gin Leu Thr Leu Val Val Thr Leu He He Met Glu Pro Pro Met 

690 695 700 

Pro He Leu Ser Tyr Pro Ser He Lys Glu Val Tyr Leu lie Cys Asn 
705 710 715 720 

Thr Ser . Asn Leu Gly Val Val Ala Pro Val Gly Tyr Asn Gly Leu Leu 

725 730 735 

He Met Ser Cys Thr Tyr Tyr Ala Phe Lys Thr Arg Asn Val Pro Ala 

740 745 750 

Asn Phe Asn Glu Ala Lys Tyr He Ala Phe Thr Met Tyr Thr Thr Cys 

755 760 765 

He He Trp Leu Ala Phe Val Pro He Tyr Phe Gly Ser Asn Tyr Lys 

770 775 780 

He He Thr Thr Cys Phe Ala Val Ser Leu Ser Val Thr Val Ala Leu 
785 790 795 800 

Gly Cys Met Phe Thr Pro Lys Met Tyr He He He Ala Lys Pro Glu 

805 810 815 

Arg Asn Val Arg Ser Ala Phe Thr Thr Ser Asp Val Val Arg Met His 

820 825 830 

Val Gly Asp Gly Lys Leu Pro Cys Arg Ser Asn Thr Phe Leu Asn He 

835 840 845 

Phe Arg Arg Lys Lys Pro Gly Ala Gly Asn Ala Asn Ser Asn Gly Lys 

850 855 860 

Ser Val Ser Trp Ser Glu Pro Gly Gly Arg Gin Ala Pro Lys Gly Gin 
865 870 875 880 

His Val Trp Gin Arg Leu Ser Val His Val Lys Thr Asn Glu Thr Ala 
885 890 895 



46 



wo 2005/015158 



PCT/US2004/025459 



Cys 


Asn 


Gin 


Thr 
900 


Ala 


Val 


He 


Lys 


Pro 
905 


Leu 


Thr 


Lys 


Ser 


Tyr 
910 


Gin 


Gly 


Ser 


Gly 


Lys 

915 


Ser 


Leu 


Thr 


Phe 


Ser 
920 


Asp 


Ala 


Ser 


Thr 


Lys 
925 


Thr 


Leu 


Tyr 


Asn 


Val 
930 


Glu 


Glu 


Glu 


Asp 


Asn 

935 


Thr 


Pro 


Ser 


Ala 


His 
940 


Phe 


Ser 


Pro 


Pro 


Ser 


Ser 


Pro 


Ser 


Met 


Val 


Val 


His 


Arg 


Airg 


Gly 


Pro 


Pro 


Val 


Ala 


Thr 


945 










950 










955 










960 


Thr 


Pro 


Pro 


Leu 


Pro 
965 


Pro 


His 


Leu 


Thr 


Ala 
970 


Glu 


Glu 


Thr 


Pro 


Leu 
975 


Phe 


Leu 


Ala 


Asp 


Ser 
980 


val 


He 


Pro 


Lys 


Gly 
985 


Leu 


Pro 


Pro 


Pro 


Leu 
990 


Pro 


Gin 


Gin 


Gin 


Pro 


Gin 


Gin 


Pro 


Pro 


Pro 


Gin 


Gin 


Pro 


Pro 


Gin 


Gin 


Pro 


Lys 






995 










1000 








1005 






Ser 


Leu 


Met 


Asp 


Gin 


Leu 


Gin 


Gly Val 


Val 


Thr 


Asn 


Phe Gly 


Ser 


Gly 




1010 








1015 








1020 








He 


Pro Asp 


Phe 


His 


Ala 


Val 


Leu 


Ala 


Gly 


Pro 


Gly Thr Pro 


Gly 


Asn 


1025 








1030 








1035 








1040 


Ser Leu Arg 


Ser 


Leu 


Tyr 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Gin 


His 


Leu 


Gin 










1045 








1050 








1055 


Met 


Leu 


Pro 


Leu 


His 


Leu 


Ser 


Thr 


Phe 


Gin 


Glu 


Glu 


Ser 


He 


Ser 


Pro 



1060 1065 1070 

Pro Gly Glu Asp He Asp Asp Asp Ser Glu Arg Phe Lys Leu Leu Gin 
1075 1080 1085 

Glu Phe Val Tyr Glu Arg Glu Gly Ash Thr Glu Glu Asp Glu Leu Glu 

1090 1095 1100 

Glu Glu Glu Asp Leu Pro Thr Ala Ser Lys Leu Thr Pro Glu Asp Ser 
1105 1110 1115 1120 

Pro Ala Leu Thr Pro Pro Ser Pro Phe Arg Asp Ser Val Ala Ser Gly 

1125 1130 1135 

Ser Ser Val Pro Ser Ser. Pro Val Ser Glu Ser Val Leu Cys Thr Pro 

1140. 1145 1150 

Pro Asn Val Thr Tyr Ala Ser Val He Leu Arg Asp Tyr Lys Gin Ser 

1155 . 1160 1165 

Ser Ser Thr Leu 
1170 

<210> 30 
<211> 1175 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 30 



Met Leu Gly Pro 


Ala Val Leu 


Gly 


Leu 


Ser 


Leu 


Trp Ala 


Leu 


Leu His 


1 


5 






10 








15 


Pro Gly Thr Gly 


Ala Pro Leu 


Cys 


Leu 


Ser 


Gin 


Gin Leu 


Arg 


Met Lys 


20 






25 








30 




Gly Asp Tyr Val 


Leu Gly Gly 


Leu 


Phe 


Pro 


Leu 


Gly Glu 


Ala 


Glu Glu 


35 




40 








45 






Ala Gly Leu Arg 


Ser Arg Thr 


Arg 


Pro 


Ser 


Ser 


Pro Val 


Cys 


Thr Arg 


50 


55 










60 






Phe Ser Ser Asn 


Gly Leu Leu 


Trp 


Ala 


Leu 


Ala 


Met Lys 


Met 


Ala Val 


65 


70 








75 






80 
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Glu Glu 


lie 


Asn 


Tyr Asp 


Leu 


Phe 






100 


Ser Leu 


Mer 


Fne 




115 




Cys Asn 


Tyr 


Thr 


130 






Hla Ser 


Ser 


Glu 


145 






Leu Met 


Pro 


Gin 


Arg Glu 


Thr 


Phe 






180 


Gin Leu 


Thr 


Ala 




195 




Val Ala 


Ala 


Leu 


210 






lie Phe 


Ser 


Ala 


225 






Gly Leu 


Val 


Pro 


Gin Asp 


Val 


Leu 






260 


Leu Fne 


aisl 


Ser 




275 




Ser Ser 


Arg 


Leu 


290 






Thr Ser 


Asp 


Leu 


305 






val Leu 


Gly 


Phe 


Tyr Val 


Lys 


Thr 






340 


Ala Leu 


Gly 


Glu 




355 




Arg Cys 


Pro 


Gin 


370 






Leu Asn 


His 


His 


385 






Ala Gin 


Ala 


Leu 


Ala Gin 


Asp 


Pro 






420 


Leu Thr 


irne 


nlS 




435 




Asn Val 


Asp 


Met 


450 






Val Pro 


Arg 


Leu 


465 






Glu Arg 


Leu 


Lys 


Ser Arg 


Cys 


Ser 






500 


Gly Phe 


His 


Ser 




515 





Asn Lys 


Ser 


Asp 


85 






Asp Thr 


Cys 


Ser 


Leu Ala 


Lys 


Ala 






120 


Gin Tyr 


Gin 


Pro 




135 




Leu Ala 


Met 


Val 


150 






Val Ser 


Tyr 


Gly 


165 






Pro Ser 


Phe 


Phe 


Ala Ala 


Glu 


Leu 






200 


Gly Ser 


Asp 


Asp 




215 




Leu Ala 


Ala 


Ala 


230 






Leu Pro 


Arg 


Ala 


245 






His Gin 


Val 


Asn 


Val His 


Ala 


Ala 






280 


Ser Pro 


Lys 


Val 




295 




Val Met 


Gly 


Leu 


310 






Leu Gin 


Arg 


Gly 


325 






His Leu 


Ala 


Leu 


Arg Glu 


Gin 


Gly 






360 


Cys Asp 


Cys 


lie 




375 




Gin Thr 


Phe 


Ser 


390 






His Asn 


Thr 


Leu 


405 






Val Lys 


Pro 


Trp 


Val Gly 


Gly 


Leu 






440 


Glu Tyr 


Asp 


Leu 




455 




His Asp 


Val 


Gly 


470 






He Arg 


Trp 


His 


485 






Arg Gin 


Cys 


Gin 


Cys Cys 


Tyr 


Asp 



520 



Leu Leu Pro 


Gly 


90 




Glu Pro Val 


Val 


105 




Gly Ser Arg 


Asp 


Arg Val Leu 


Ala 




140 


Thr Gly Lys 


Pne 


155 




Ala Ser Met 


Glu 


170 . 




Arg Thr Val 


Pro 


185 




Leu Gin Glu 


Phe 


Glu Tyr Gly 


Arg 




220 


Arg Gly He 


Cys 


235 




Asp Asp Ser 


Arg 


250 




Gin Ser Ser 


Val 


265 




His Ala Leu 


Phe 


Trp Val Ala 


Ser 




300 


Pro Gly Met 


Ala 


315 




Ala Gin Leu 


His 


330 




Ala Thr Asp 


Pro 


345 




Leu Glu Glu 


Asp 


Thr Leu Gin 


Asn 




380 


Val Tyr Ala 


Ala 


395 




Gin Cys Asn 


Ala 


410 




Gin Leu Leu 


Glu 


425 




Pro Leu Arg 


Phe 


Lys Leu Trp 


Val 




460 


Arg Phe. Asn 


Gly 


475 




Thr Ser Asp 


Asn 


490 




Glu Gly Gin 


Val 


505 




Cys Val Asp 


Cys 


48 





Leu Arg 


Leu Gly 






Ala Met 


Lys Pro 


110 




He Ala 


Ala Tyz 


125 




Val He 


Gly Pro 


Phe Ser 


Phe Phe 




160 


Leu Leu 


Ser Ala 




175 


Ser Asp 


Arg Val 


190 




Gly Trp 


Asn Trp 


205 




Gin Giy 


Leu Ser 


He Al^ 


His Glu 




240 


Leu Gly 


Lys Val 




255 


Gin Val 


Val Leu 


270 




Asn Tyr 


Ser He 


285 




(31u Ala 


Trp Leu 


Gin Met 


Gly Thr 




320 


Glu Phe 


Pro Gin 




335 


Ala Phe 


Cys Ser 


350 




Val Val 


Gly Gin 


365 




Val Ser 


Ala Gly 


Val Tyr 


Ser Val 




400 


Ser Gly 


Cys Pro 




415 


Asn Met 


Tyr Asn 


430 




Asp Ser 


Ser Gly 


445 




Trp Gin 


Gly Ser 


Ser Leu 


Arg Thr 




480 


Gin Lys 


Pro Val 




495 


Arg Arg 


Val Lys 


510 




Glu Ala 


Gly Ser 


525 
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Tyr 


Arg 


Gin Asn 


Pro 


Asp 


Asp He Ala Cys 


Thr Phe Cys Gly Gin Asp 




530 








535 


540 


Qlu 


Trp 


Ser Pro 


Glu 


Arg 


Ser Thr Arg Cys 


Phe Arg Arg Arg Ser. Arg 


545 








550 




555 560 


Phe 


Leu 


Glu Trp 


Ser 


Asp 


He Glu Ser He 


He Ala He Ala Phe Ser 








565 




570 


575 


Cys 


Leu 


Gly He 


Leu 


val 


Thr Leu Phe Val 


Thr Leu He Phe Val Leu 






580 






585 


590 


Tyx 


Arg 


Asp Thr 


Pro 


Val 


Val Lys Ser Ser 


Ser Arg Qlu Leu Cys Tyr 






595 






600 


605 


lie 


He 


Leu Ala 


Gly 


He 


Phe Leu Gly Tyr 


Val Cys Pro Phe Thr Leu 




610 








615 


620 


He 


Ala 


Lys Pro 


Thr 


Thr 


Thr Ser Cys Tyr 


Leu Gin Arg Leu Leu Val 


625 








630 




635 640 


Gly 


Leu 


Ser Ser 


Ala 


Met 


Cys Tyr Ser Ala 


Leu Val Thr Lys Thr Asn 








645 




650 


655 


Arg 


He 


Ala Arg 


He 


Leu 


Ala Gly Ser Lys 


Lys Lys He Cys Thr Arg 






660 






665 


670 


Lys 


Pro 


Arg Phe 


Met 


Ser 


Ala Trp Ala Gin 


Val He He Ala Ser He 






675 






680 


685 


Leu 


He 


Ser Val 


Gin 


Leu 


Thr Leu Val Val 


Thr Leu He He Met Glu 




690 








695 


700 


Pro 


Pro 


Met Pro 


He 


Leu 


Ser Tyr Pro Ser 


He Lys Glu Val Tyr Leu 


705 








.710 




715 720 


lie 


Cys 


Asn Thr 


Ser 


Asn 


Leu Gly Val Val 


Ala Pro Val Gly Tyr Asn 








725 




730 


735 


Gly 


Leu 


Leu He 


Met 


Ser 


Cys Thr Tyr Tyr 


Ala Phe Lys Thr Arg Asn 






740 






745 


750 


Val 


Pro 


Ala Asn 


Phe 


Asn 


Glu Ala Lys Tyr 


He Ala Phe Thr Met Tyr 






755 






760 


765 


Thr 


Thr 


Cys He 


He 


Trp 


Leu Ala Phe Val 


Pro He Tyr Phe Gly Ser 




770 








775 


780 


Asn 


Tyr 


Lys He 


He 


Thr 


Thr Cys Phe Ala 


Val Ser Leu Ser Val Thr 


785 








790 




795 800 


Val 


Ala 


Leu Gly 


Cys 


Met 


Phe Thr Pro Lys 


Met Tyr He He He Ala 








805 




810 


815 


Lys 


Pro 


Glu Arg 


Asn 


Val 


Arg Ser Ala Phe 


Thr Thr Ser Asp Val Val 






820 






825 


830 


Arg 


Met 


His Val 


Gly 


Asp 


Gly Lys Leu Pro 


Cys Arg Ser Asn Thr Phe 






835 






840 


845 . 


Leu 


Asn 


He Phe 


Arg 


Arg 


Lys Lys Pro Gly 


Ala Gly Asn Ala Asn Ser 




850 








855 


860 


Asn 


Gly 


Lys Ser 


Val 


Ser 


Trp Ser Glu Pro 


Gly Gly Arg Gin Ala Pro 


865 








870 




875 880 


Lys 


Gly 


Gin His 


Val 


Trp 


Gin Arg Leu Ser 


Val His Val Lys Thr Asn 








885 




890 


895 


Glu 


Thr 


Ala Cys 


Asn 


Gin 


Thr Ala Val He 


Lys Pro Leu Thr Lys Ser 






900 






905 


910 


Tvx 


Gin 


Gly Ser 


Gly 


Lys 


Ser Leu Thr Phe 


Ser Asp Ala Ser Thr Lys 






915 






920 


925 


Thr 


Leu 


Tyr Asn 


Val 


Glu 


Glu Glu Asp Asn 


Thr Pro Ser Ala Hi is Phe 




930 








935 


940 


Ser 


Pro 


Pro Ser 


Ser 


Pro 


Ser Met Val Val 


His Arg Arg Gly Pro Pro 


945 








950 




955 . 960 


Val 


Ala 


Thr Thr 


Pro 


Pro 


Leu Pro Pro His 


Leu Thr Ala Glu Glu Thr 








965 




970 


975 
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Pro Leu Phe Leu Ala Asp Ser Val lie Pro Lys Gly Leu Pro Pro Pro 

980 985 990 

Leu Pro Gin Gin Gin Pro Gin Gin Pro Pro Pro Gin Gin Pro Pro Gin 

995 1000 1005 

Gin Pro Lys Ser Leu Met Asp Gin Leu Gin Gly Val Val Thr Asn Phe 

1010 1015 1020 

Gly Ser Gly He Pro Asp Phe His Ala Val Leu Ala Gly Pro Gly Thr 
1025 1030 . 1035 1040 

Pro Gly Asn Ser Leu Arg Ser Leu Tyr Pro Pro Pro. Pro Pro Pro Gin 

1045 1050 1055 

His Leu Gin Met Leu Pro Leu His Leu Ser Thr Phe Gin Glu Glu Ser 

1060 1065 1070 

He Ser Pro Pro Gly Glu Asp lie Asp Asp Asp Ser Glu Arg Phe Lys 

10.75 1080 1085 

Leu Leu Gin Glu Phe Val Tyr Glu Arg Glu Gly Asn Thr Glu Glu Asp 

1090 1095 1100 

Glu Leu Glu Glu Glu Glu Asp Leu Pro Thr Ala Ser Lys Leu Thr Pro 
1105 1110 1115 1120 

Glu Asp Ser Pro Ala Leu Thr Pro. Pro Ser Pro Phe Arg Asp Ser Val 

1125 1130 1135 

Ala Ser Gly Ser Ser Val Pro Ser Ser Pro Val Ser Glu Ser Val Leu 

1140 1145 1150 

Cys Thr Pro Pro Asn Val Thr Tyr Ala Ser Val He Leu Arg Asp Tyr 

1155 1160 1165 

Lys Gin Ser Ser Ser Thr Leu 
1170 1175 

<210> 31 
<211> 867 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note » 
synthetic construct 

<400> 31 

Met Val Arg Leu Leu Leu He Phe ,Phe Pro Met He Phe Leu Glu Met 

1 5 10 15 

Ser He Leu Pro Arg Met Pro Asp Arg Lys Val Leu Leu Ala Gly Ala 

20 25 30 

Ser Ser Gin Arg Ser Val Ala Arg Met Asp Gly Asp Val He He Gly 

35 . 40 45 

Ala Leu Phe Ser Val His His Gin Pro Pro Ala Glu Lys Val Pro Glu 

50 55 60 

Arg Lys Cys Gly Glu He Arg Glu Gin Tyr Gly He Gin Arg Val Glu 
65 70 75 80 

Ala Met Phe His Thr Leu Asp Lys He Asn Ala Asp Pro Val Leu Leu 

} 85 90 95 

Pro Ai^n He Thr Leu Gly Ser Glu He Arg Asp Ser Cys Trp His Ser 

100 105 110 

Ser Val Ala Leu Glu Gin Ser He Glu Phe He Arg Asp Ser Leu lie 

115 120 125 

Ser He Arg Asp Glu Lys Asp Gly Leu Asn Arg Cys Leu Pro Asp Gly 

130 135 140 

Gin Thr Leu Pro Pro Gly Arg Thr Lys Lys Pro He Ala Gly Val He 
145 150 155 160 
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Gly Pro Gly Ser Ser Ser Val Ala He Gin Val Gin Asn Leu Leu Gin 

165 170 175 

Leu Phe Asp He Pro Gin He Ala Tyr Ser Ala Thr Ser He Asp Leu 

180 185 190 

Ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg Val Val Pro Ser Asp 

195 200 205 

Thr Leu Gin Ala Arg Ala Met Leu Asp He Val Lys Arg Tyr Asn Trp 

210 215 220 

Thr Tyr Val Ser Ala Val His Thr Glu Gly Asn Tyr Gly Glu Ser Gly 
225 230 235 240 

Met Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu Gly Leu Cys He Ala 

245 250 255 

His Ser Asp Lys He Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 

260 265 270 

Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys Ala Arg Val Val Val 

275 280 285 

Cys Phe cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 

290 295 300 

Arg Leu Gly Val Val Gly Glu Phe Ser Leu He Gly Ser Asp Gly Trp 
305 310 315 320 

Ala Asp Arg Asp Glu Val He Glu Gly Tyr Glu Val Glu Ala Asn Gly 

325 330 335 

Gly He Thr He Lys Leu Gin Ser Pro Glu Val Arg Ser Phe Asp Asp 

340 345 350 

Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 

355 360 365 

Pro Glu Phe Trp Gin His Arg Phe Gin Cys Arg Leu Pro Gly His Leu 

370 375 380 

Leu Glu Asn Pro Ash Phe Lys Lys Val Cys Thr Gly Asn Glu Ser Leu 
385 390 395 400 

Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly Phe Val He Asn Ala 

405 410 415 

He Tyr Ala Met Ala His Gly Leu Gin Asn Met His His Ala Leu Cys 

420 425 430 

Pro Gly His Val Gly Leu Cys Asp Ala Met Lys Pro He Asp Gly Arg 

.435 440 445 

Lys Leu Leu Asp Phe Leu He Lys Ser Ser Phe Val Gly Val Ser Gly 

450 455 460 

Glu Glu Val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 
465 470 475 480 

He Met Asn Leu Gin Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 

485 490 495 

Val Gly Thr Trp His Glu Gly Val Leu Asn lie Asp Asp Tyr Lys He 

500 505 510 

Gin -Met Asn Lys Ser Gly Met Val Arg Ser Val Cys Ser Glu Pro. Cys 

515 520 525 

Leu Lys Gly Gin He Lys Val He Arg Lys Gly Glu Val Ser Cys Cys 

.530 535 540 

Trp He Cys Thr Ala Cys Lys Glu Asn Glu Phe Val Gin Asp Glu Phe 
545 550 555 560 

Thr Cys Arg Ala Cys Asp Leu Gly Trp Trp Pro Asn Ala Glu Leu Thr 

565 570 575 

Gly Cys Glu Pro He Pro Val Arg Tyr Leu Glu Leu Arg Glu His Thr 

580 585 590 

Ser Trp Val Leu Leu Ala Ala Asn Thr Leu Leu Leu Leu Leu Leu Leu 

595 600 605 

Gly Thr Ala Gly Leu Phe Ala Trp His Leu Asp Thr Pro Val Val Arg 
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610 615 620 

Ser Ala Gly Gly Arg Leu Cys Phe Leu Met Leu Gly Ser Leu Ala Ala 
625 630 635 640 

Gly Ser Gly Ser Leu Tyr Gly Phe Phe Gly Glu Pro Thr Arg Pro Ala 

645 650 655 

Cys Leu Leu Arg Gin Ala Leu Phe Ala Leu Gly Phe Thr He Phe Leu 

660 665 670 

Ser Cys Leu Thr Val Arg Ser Phe Gin Leu He He He Phe Lys Phe 

675 680 685 

Ser Thr Lys Val Pro Thr Phe Tyr His Ala Trp Val Gin Asn His Gly 

690 695 700 

Ala Gly Leu Phe Val Met He Ser Ser Ala Ala Gin Leu Leu He Cys 
705 710 715 720 

Leu Thr Trp Leu Val Val Trp Thr Pro Leu Pro Ala Arg Glu Tyr Gin 

725 730 735 

Arg Phe Pro His Leu Val Met Leu Glu Cys Thr Glu Thr Asn Ser Leu 

740 745 750 

Gly Phe He Leu Ala Phe Leu Tyr Asn Gly Leu Leu Ser He Ser Ala 

755 760 765 

Phe Ala Cys Ser Tyr Leu Gly Lys Asp Leu Pro Glu Asn Tyr Asn Glu 

770 775 780 

Ala Lys Cys Val Thr Phe Ser Leu Leu Phe Asn Phe Val Ser Trp He 
785 790 795 800 

Ala Phe Phe Thr Thr Ala Ser Val Tyr Asp Gly Lys Tyr Leu Pro Ala 

805 810 815 

Ala Asn Met Met Ala Gly Leu Ser Ser Leu Ser Ser Gly Phe Gly Gly 

820 825 830 

Tyr Phe Leu Pro Lys Cys Tyr Val He Leu Cys Arg Pro Asp Leu Asn 

835 840 845 

Ser Thr Glu His Phe Gin Ala Ser He Gin Asp Tyr Thr Arg Arg Cys 

850 855 860 

Gly Ser Thr 
865 

<210> 32 
<211> 866 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 



<400> 32 
















Met Val 


Arg 


Leu 


Leu 


Leu 


He 


Phe Phe 


Pro Met 


He Phe Leu Glu Met 


1 






5 








10 


15 


Ser He 


Leu 


Pro 


Arg 


Net 


Pro 


Asp Arg 


Lys Val 


Leu Leu Ala Gly Ala 






20 








25 




30 


Ser Ser 


Gin 


Arg 


Ser 


Val 


Ala 


Arg Met 


Asp Gly 


Asp Val He He Gly 




35 










40 




45 


Ala Leu 


Phe 


Ser 


Val 


His 


His 


Gin Pro 


Pro Ala 


Glu Lys Val Pro Glu 


50 










55 






60 


Arg Lys 


Cys 


Gly 


Glu 


He 


Arg 


Glu Gin 


Tyr Gly 


He Gin Arg Val Glu 


.65 








70 






75 


80 


Ala Met 


Phe 


His 


Thr 


Leu 


Asp 


Lys He 


Asn Ala 


Asp Pro Val Leu Leu 








85 








90 


95 


Pro Asn 


He 


Thr 


Leu 


Gly 


Ser 


Glu He 


Arg Asp 


Ser Cys Trp His Ser 
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100 105 110 

Ser Val Ala Leu Glu Gin Ser lie Glu Phe lie Arg Asp Ser Leu He 

115 120 125 

Ser He Arg Asp Glu Lys Asp Gly Leu Asn Arg. Cys lieu Pro Asp Gly 

130 135 140 

Gin Thr Leu Pro Pro Gly Arg Thr Lys Lys Pro He Ala Gly Val He 
145 150 155 160 

Gly Pro Gly Ser Ser Ser Val Ala He Gin Val Gin Asn Leu Leu Gin 

165 170 175 

Leu Phe Asp He Pro Gin He Ala Tyr Ser Ala Thr Ser He Asp Leu 

180 185 190 

Ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg Val Val Pro Ser Asp 

195 200 205 

Thr Leu Gin Ala Arg Ala Met Leu Asp He Val Lys Arg Tyr Asn Trp 

210 215 220 

Thr Tyr Val Ser Ala Val His Thr Glu Gly Asn Tyr Gly Glu Ser Gly 
225 230 235 240 

Met Asp Ala Phe Lys Glu Leu Ala Ala Gin Glu Gly Leu Cys He Ala 

245 250 255 

His Ser Asp Lys He Tyr Ser Asn Ala Gly. Glu Lys Ser Phe Asp Arg 

260 265 270 

Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys Ala Arg Val Val Val 

275 280 285 

Cys Phe Cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 

290 295 300 

Arg Leu Gly Val Val Gly Glu Phe Ser Leu He Gly Ser Asp Gly Trp 
305 310 315 320 

Ala Asp Arg Asp Glu Val He Glu Gly Tyr Glu Val Glu Ala Asn Gly 

325 330 335 

Gly He Thr He Lys Leu Gin Ser Pro Glu Val Arg Ser Phe Asp Asp 

340 345 350 

Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 

355 360 365 

Pro Glu Phe Trp Gin His Arg Phe Gin Cys Arg Leu Pro Gly His Leu 

370 375 380 

Leu Glu Asn Pro Asn Phe Lys Lys Val Cys Thr Gly Asn Glu Ser Leu 
385 390 395 400 

Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly Phe Val He Asn Ala 

405 410 415 

He Tyr Ala Met Ala His Gly Leu Gin Asn Met His His Ala Leu Cys 

420 425 430 

Pro Gly His Val Gly Leu Cys Asp Ala Met. Lys Pro He Asp Gly Arg 

435 440 445 

Lys Leu Leu Asp Phe Leu He Lys Ser Ser Phe Val Gly Val Ser Gly 

450 455 460 

Glu Glu Val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 
465 470 475 480 

He Met Asn Leu Gin Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 

485 490 . 495 

Val Gly Thr Trp His Glu Gly Val Leu Asn He Asp Asp. Tyr Lys He 

500 505 510 

Gin Met Asn Lys Ser Gly Met Val Arg Ser Val Cys Ser Glu Pro Cys 

515 520 525 

Leu Lys Gly Gin He Lys Val He Arg Lys Gly Glu Val Ser Cys Cys 

530 535 540 

Trp He Cys Thr Ala Cys Lys. Glu Asn Glu Phe Val Gin Asp Glu Phe 
545 550 555 560 
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Thr Cys Arg Ala Cys Asp Leu Gly Trp Trp Pro Asn Ala Glu Leu Tlir 

565 570 575 

Gly Cys Glu Pro lie Pro Val Arg Tyr Leu Glu Trp His Glu Ala Pro 

580 585 590 

Thr lie Ala Val Ala Leu Leu Ala Ala Leu Gly Phe Leu Ser Thr Leu 

595 600 605 

Ala He Leu Val He Phe Trp Arg His Phe Gin Thr Pro He Val Arg 

610 615 620 

ser Ala Gly Gly Pro Met Cys Phe Leu Met Leu Thr Leu Leu Leu Val 
625 630 635 640 

Ala Tyr Met Val Val Pro Val Tyr Val Gly Pro Pro Lys Val Ser Thr 

645 650 655 

Cys Leu Cys Arg Gin Ala Leu Phe Pro Leu Cys Phe Thr He Cys He 

660 665 670 

Ser Cys He Ala Val Arg Ser Phe Gin He Val Cys Ala Phe Lys Met 

675 680 685 

Ala Ser Arg Phe Pro Arg Ala Tyr Ser Tyr .Trp Val Arg Tyr Gin Gly 

690 695 700 

Pro Tyr Val Ser Met Ala Phe He Thr Val Leu Lys Met Val He Val 
705 710 715 720 

Val He Gly Met Leu Ala Thr Gly Leu Ser Pro Thr Thr Arg Thr Asp 

725 730 735 

Pro Asp Asp Pro Lys He Thr He Val Ser Cys Asn Pro Asn Tyr Arg 

740 745 750 

Asn Ser Leu Leu Phe Asn Thr Ser Leu Asp Leu Leu Leu Ser Val Val 

755 760 765 

Gly Phe Ser Phe Ala Tyr Met Gly Lys Glu Leu Pro Thr Asn Tyr Asn 

770 775 780 

Glu Ala Lys Phe He Thr Leu Ser Met Thr Phe Tyr Phe Thr Ser Ser 
785 790 795 800 

Val Ser Leu Cys Thr Phe Met Ser Ala Tyr Ser Gly Val Leu Val Thr 

605 810 815 

He Val Asp Leu Leu Val Thr Val Leu Asn Leu Leu Ala He Ser Leu 

820 825 830 

Gly Tyr Phe Gly Pro Lys Cys Tyr Met He Leu Phe Tyr Pro Glu Arg 

835 840 845 

Asn Thr Pro Ala Tyr Phe Asn Ser Met He Gin Gly Tyr Thr Met Arg 

850 855 . 860 

Arg Asp 
865 

<210> 33 
<211> 876 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; note = 
synthetic construct 

<400> 33 

Met Val Arg Leu Leu Leu He Phe Phe Pro Met He Phe Leu Glu Met 

1 5 10 15 

Ser He Leu Pro Arg Met Pro Asp Arg Lys Val Leu Leu Ala Gly Ala 

20 25 30 

Ser Ser Gin Arg Ser Val Ala Arg Met Asp Gly Asp Val He He Gly 
35 40 45 
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Ala Leu Phe Ser Val His His Gin Pro Pro Ala Glu Lys Val Pro Glu 

50 55 60 

Arg Lys Cys Gly Glu He Arg Glu Gin Tyr Gly He Gin Arg Val Glu 
65 70 75 80 

Ala Met Phe His Thr Leu Asp Lys He Asn Ala Asp Pro Val Leu Leu 

85 90 95 

Pro Asn He Thr Leu Gly Ser Glu He Arg Asp Ser Cys Trp His Ser 

100 105 HO 

Ser Val Ala Leu Glu Gin Ser He Glu Phe He Arg Asp Ser Leu He 

115 120 125 

Ser He Arg Asp Glu Lys Asp Gly Leu Asn Arg Cys Leu Pro Asp Gly 

130 135 140 

Gin Thr Leu Pro Pro Gly Arg Thr Lys Lys Pro He Ala Gly Val lie 
145. 150 155 160 

Gly Pro Gly Ser Ser Ser Val Ala He Gin Val Gin Asn Leu Leu Gin 

165 170 175 

Leu Phe Asp He Pro Gin He Ala Tyr Ser Ala Thr Ser He Asp Leu 

180 185 190 

Ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg Val Val Pro Ser Asp 

195 200 205 

Thr Leu Gin Ala Arg Ala Met Leu Asp He Val Lys Arg Tyr Asn Trp 

210 215 220 

Thr Tyr Val Ser Ala Val His Thr Glu Gly Asn Tyr Gly Glu Ser Gly 
225 230 235 240 

Met Asp Aia Phe Lys Glu Leu Ala Ala Gin Glu Gly Leu Cys He Ala 

245 250 255 

His Ser Asp Lys He Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 

260 265 270 

Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys Ala Arg Val Val Val 

275 280 285 

Cys Phe Cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 

290 295 300 

Arg . Leu Gly Val Val Gly Glu Phe Ser Leu He Gly Ser Asp Gly Trp 
305 310 315 320 

Ala Asp Arg Asp Glu Val He Glu Gly Tyr Glu Val Glu Ala Asn Gly 

325 330 335 

Gly lie Thr He Lys Leu Gin Ser Pro Glu Val Arg Ser Phe Asp Asp 

340 345 350 

Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 

355 360 365 

Pro Glu Phe Trp Gin His Arg Phe Gin Cys Arg Leu Pro Gly His Leu 

370 375 380 

Leu Glu Asn Pro Asn Phe Lys Lys Val Cys Thr Gly Asn Glu Ser Leu 
385 390 395 400 

Glu Glu Asn Tyr Val Gin Asp Ser Lys Met Gly Phe Val He Asn Ala 

405 410 415 

He Tyr Ala Met Ala His Gly Leu Gin Asn Met His His Ala Leu Cys 

420 425 430 

Pro Gly His Val Gly Leu Cys Asp Ala Met Lys Pro He Asp Gly Arg 

435 440 445 

Lys Leu Leu Asp Phe Leu He Lys Ser Ser Phe Val Gly Val Ser Gly 

450 455 460 

Glu Glu Val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 
465 470 475 480 

He Met Asn Leu Gin Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 

485 490 495 

Val Gly Thr Trp His Glu Gly Val Leu Asn He Asp Asp Tyr Lys He 
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r 
















505 










510 








neu 


Asn 


Lys 


Got* 

oer 


viXjr 


i*icu 


Val 

VCbX 


Axg 


OCX 


Val 


uys 


OCX 


Glu 










D ±D 










520 










525 








X16II 


Lys 


ur±y 


m n 
(aJLZl 


lie 


Jjjr 0 


VCLX 


Tie 
xxc 


*ixg 


Lys 




Glu 


Val 


Ser 


Cys 






do u 










3<9 w 






















lie 


Cys 




Axel 






wX U 


Asn 


Glu 




Val 


Gin 


Asp 


Glu 
























555 










560 


1X1 £^ 


Cys 


Arc[ 


i\xa 








V7xy 


irp 


lirp 


f X \J 


Asn 


Ala 


Glu 


Leu 


XllX 










565 










570 










575 






Cys 


VjJ.U 


Pro 


X xc 


Pro 


vcix 




lyiT 


Leu 


ni n 

wX u 


ixp 


Glv 


Glu 


Pro 


Ala 








sou 










3 0 3 










590 






VClX 


ijeu 


Leu 


Leu 


Leu 


■ 

Leu 


Leu 


Leu 


oex 


XIBLL 


2X1 A 




wxy 




Val 
vox 


XICU. 
















Ann 










D V9 








Ax a 


Aia 


Leu 


viiy 


Leu 


irne 


vai 


nxS 


XlxS 


Arg 


Asp 


sex 


Pro 


xieu 


Val 


Gl n 

wXU 




DIU 










0 x3 










WAV 










Aid. 


Ser 


Giy 


Giy 


Pro 


Leu 


Aia 


Cys 


irne 


v»iy 


Leu 


Val 


Cys 


Leu 


uxy 


Leu 


cm 










0 J u 










Q J9 










04 u 


vai 


Cys 


Leu 


CP A<w> 

ser 


vai 


Leu 


Leu 


pne 


pro 


vjiy 


(9in 


Pro 


ber 


Pro 


lil a 

AX a 


Arg 










£ A c 










-09U 














cys 


Leu 


Ala 


oin 


1*^1 

c^in 


Fro 


Leu 


Ser 


iixS 


Leu 


Pro 


Leu 


Tnr 


f2l \/- 

vaiy 


Cys 


Leu 








cct\ 
ooU 










OOD 










S7ft 






Ser 


Thr 


Leu 


PJie 


Leu 


n1 -ri 

Qin 


AT S3 

Ala 


TVl sa 

Ala 


CjIU 


lie 


Pue 


vai 


V3IU 


Ser 


V7IU 


Leu 
















boU 










DOS 




■ 




Pro 


Leu 


Ser 


Trp 


Ala 


Asp 


TV 

Arg 


Leu 


Ser 


(aiy 


Cys 


Leu 


Arg 


vafiy 


Pro 


Trp 




f 0 n 




















7ftft 










Ala 


Trp 


Leu 


vai 


vai 


Leu 


Leu 


Aia 


Met 


Leu 


vax 


VsXU 


vai 


ai a 
AX a 


Leu 


Cys 












./ XU 










71 R 

/ X3 










79 0 


Thr 


Trp 


Tyr 


Leu 


vai 


Aia 


oh A 

JrOe 


Pro 


pro 


urlu 


vai 


Va1 

vai 


inr 


Asp 


Trp 


nxS 










TOR 










Tin 










7"^ 




Met 


.Leu 


Pro 


Tnr 


QIU 


Aia 


Leu 


-vai 


IT -I 0 ' 

xlxS 


Cys 


Arg 


inr 


Arg 


Ser 


Trp 


Val 

vai 




























7<;ft 






Ser 


Pxie 


Qiy 


Leu 


Aia 


xlxS 


Axa 


inr 


Asn 


2i1 a 
AX a 


XUX 


Leu 


&1 a 

Axa 




Leu 


Cys 






*7 I? 




















765 










Leu 


ijiy 


iiir 


irUe 


Leu 


xra 1 

Va.X 


Arg 






Pro 


fSl X/ 


Arg 


Tyr 


AOXX 


Arg 




•7*70 










lis 










780 










Ala 


Arg 


viiy 


Leu 


inr 


rrUe 


al a 
AX a. 




Leu 


a 

AXa 


Tyr 


iruxs 


Tl 
XXC 


XAx 




Val 


*? Q c 










7Qft 










/ 79 










fiOO 


Ser 


Ptie 


vai 


Pro 


Leu 


Leu 


&1 a 

Aia 


Asn 


T7a 1 

. vai 


r!l n 


vai 


Ual 

vai 


Leu 


Arg 


Pro 


2il a 
AXa 










805 










810 










815 




val 


Gin 


Met 


Gly 


Ala 


Leu 


Leii 


Leu 


Cys 


Val 


Leu 


Gly 


He 


Leu 


Ala 


Ala 








620 










825 










830 






Phe 


His 


Leu 


Pro 


Arg 


Cys 


Tyr 


Leu 


Leu 


Met 


Arg 


Gin 


Pro 


Gly 


Leu 


Asn 






835 










840 










845 








Thr 


Pro 


Glu 


Phe 


Phe 


Leu 


Gly 


Gly 


Gly 


Pro 


Gly 


Asp 


Ala 


Gin 


Gly 


Gin 




850 










855 










860 










Asn 


Asp 


Gly 


Asn 


Thr 


Gly 


Asn 


Gin 


Gly 


Lys 


His 


Glu 











865 . 870 875 
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